PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

lutcniuiionsl Bureau 



(51) International Patent Classification 6 : 

A01H 5/00, C07H 21/00, C12N 15/00, 
15/82 



Al 



(11) International Publication Number: WO 98/54954 

(43) International Publication Date: 10 December 1998 (10.12.98) 



(21) International Application Number: PCT/US98/I 1384 

(22) International Filing Date: 1 June 1998 (01.06.98) 



(30) Priority Data: 

08/868,373 



3 June 1997 (03.06.97) 



US 



(71) Applicant (for all designated States except US): CAR GILL, 
INCORPORATED [US/US]; 10547 West McGinty Road, 
Wayzata, MN 55391 (US). 

(71)(72) Applicants and Inventors: JAWORSKI, Jan, G. [US/US1; 
425 Emerald Woods Drive, Oxford, OH 45058 (US). 
POST-BEITTENMILLER, Martha, Ann [US/US]; 2375 
Quail Road, Ardmore. OH 73491 (US). TODD, James 
[US/US]; 17 Kelly Drive, Oxford, OH 45056 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB BG BR 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, Gb' GE 
GH t GM, GW, HU, ID, IL, IS, JP, KE, KG, KP, KR* KZ* 
LC. LK, LR, LS, LT, LU, LV, MD, MG, MK. MN MW* 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK SL 
TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB GR 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, Cf[ 
CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(74) Agent: LUNDQUIST, Ronald, C; Fish & Richardson P.C., 
P.A.. Suite 3300, 60 South Sixth Street, Minneapolis, MN 
55402 (US). 



(54) Title: FATTY ACID ELONGASES 



(57) Abstract 



Nucleic acids are disclosed that encode fatty acid 0-keto acyl synthases from plants. Such synthases are effective for producig 
very long chain fatty acids (VLCFA), e.g., C22 to C26, preferentially saturated but also monounsaturated. Also disclosed are polypeptides 
encoded by such nucleic acids. Transgenic plants expressing these polypeptides exhibit altered levels of VLCFA in one or more tissues, 
such as seeds or leaves. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


Prance 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vie* Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


C! 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


no 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






OK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/54954 



PCT/US98/1 1384 



FATTY ACID ELONGASES 
Field of the Invention 
5 This invention relates to fatty acid elongase 

complexes and nucleic acids encoding elongase proteins. 
More particularly, the invention relates to nucleic acids 
encoding fi-keto acyl synthase proteins that are effective 
for producing very long chain fatty acids, polypeptides 
10 produced from such nucleic acids and transgenic plants 
expressing such nucleic acids. 

Background of the Invention 
Plants are known to synthesize very long chain 
fatty acids (VLCFAs) . VLCFAs are saturated or 

15 unsaturated monocarboxylic acids with an unbranched even- 
numbered carbon chain that is greater than 18 carbons in 
length. Many VLCFAs are 20-32 carbons in length, but 
VLCFAs can be up to 60 carbons in length. Important 
VLCFAs include erucic acid (22:1, i.e., a 22 carbon chain 

20 with one double bond), nervonic acid (24:1), behenic acid 
(22:0), and arachidic acid (20:0). 

Plant seeds accumulate mostly 16- and 18-carbon 
fatty acids. VLCFAs are not desirable in edible oils. 
Oilseeds of the Cmcifereae (e.g., rapeseed) and a few 

25 other plants, however, accumulate C20 and C22 fatty acids 
(FAs) . Although plant breeders have developed rapeseed 
lines that have low levels of VLCFAs for edible oil 
purposes, even lower levels would be desirable. On the 
other hand, vegetable oils having elevated levels of 

30 VLCFAs are desirable for certain industrial uses, 

including uses as lubricants, fuels and as a feedstock 
for plastics, pharmaceuticals and cosmetics. 
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The biosynthesis of saturated fatty acids up to an 
18-carbon chain occurs in the chloroplast. C2 units from 
acyl thioesters are linked sequentially, beginning with 
the condensation of acetyl Coenzyme A (CoA) and malonyl 
5 acyl carrier protein (ACP) to form a C4 acyl fatty acid. 
This condensation reaction is catalyzed by a S-ketoacyl 
synthase III (KASIII) . S-ketoacyl moieties are also 
referred to as 3-ketoacyl moieties. 

The enzyme S-ketoacyl synthase I (KASI) is 

10 involved in the addition of C2 groups to form the C6 to 
C16 saturated fatty acids. KASI catalyzes the stepwise 
condensation of a fatty acyl moiety (C4 to C14) with 
malonyl -ACP to produce a 3-ketoacyl -ACP product that is 2 
carbons longer than the substrate. The last condensation 

15 reaction in the chloroplast, converting C16 to C18, is 
catalyzed by S-ketoacyl synthase II (KASII) . 

Each elongation cycle involves three additional 
enzymatic steps in addition to the condensation reaction 
as discussed above. Briefly, the S-ketoacyl condensation 

20 product is reduced to S-hydroxyacyl-ACP, dehydrated to 
the enoyl-ACP, and finally reduced to a fully reduced 
acyl -ACP. The fully reduced fatty acyl -ACP reaction 
product then serves as the substrate for the next cycle 
of elongation. 

25 The C18 saturated fatty acid (stearic acid, 18:0) 

can be transported out of the chloroplast and converted 
to the monounsaturate C18:l (oleic acid), and the 
polyunsaturates C18:2 (linoleic acid) and C18:3 (<*- 
linolenic acid). C18:0 and C18:l can also be elongated 

30 outside the chloroplast to form VLCFAs. The formation of 
VLCFAs involves the sequential condensation of two carbon 
groups from malonyl CoA with a C18:0 or C18:l fatty acid 
substrate. Elongation of fatty acids longer than 18 
carbons depends on the activity of a fatty acid elongase 

35 complex to carry out four separate enzyme reactions 
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similar to those described above for fatty acid synthesis 
in the chloroplast. Fehling, Biochem. Biophys. Acta 
1082:239-246 (1991). In plants, elongase complexes are 
distinct from fatty acid synthases since elongases are 
5 extraplastidial and membrane bound. 

Mutations have been identified in an Arabidopsis 
gene associated with fatty acid elongation. This gene, 
designated the FAE1 gene, is involved in the condensation 
step of an elongation cycle. See, WO 96/13582, 

10 incorporated herein by reference* Plants carrying a 

mutation in FAE1 have significant decreases in the levels 
of VLCFAs in seeds. Genes associated with wax 
biosynthesis in jojoba have also been cloned and 
sequenced (WO 95/15387, incorporated herein by 

15 reference) . 

Very long chain fatty acids are key components of 
many biologically important compounds in animals, plants, 
and microorganisms. For example, in animals, the VLCFA 
arachidonic acid is a precursor to many prostaglandins. 

20 In plants VLCFAs are major constituents of 

triacylglycerols in many seed oils, are essential 
precursors for cuticular wax production, and are utilized 
in the synthesis of glycosylceramides, an important 
component of the plasma membrane. 

25 Obtaining detailed information on the biochemistry 

of KAS enzymes has been hampered by the difficulties 
encountered when purifying membrane bound enzymes. 
Although elongase activities have been partially purified 
from a number of sources, or studied using cell 

30 fractions, the elucidation of the biochemistry of 

elongase complexes has been hampered by the complexity of 
the membrane fractions used as the enzyme source. For 
example, until recently, it was unclear as to whether 
plant elongase complexes were composed of a 

35 multifunctional polypeptide similar to the FAS found in 
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animals and yeast, or if the complexes existed as 
discrete and dissociable enzymes similar to the FAS of 
plants and bacteria. Partial purification of an elongase 
KAS, immunoblot identification of the hydroxy acyl 
5 dehydrase, and the recent cloning of a KAS gene {FAE1) 
suggest that the enzyme activities of elongase complexes 
exist on individual enzymes. 

Summary of the Invention 
The invention disclosed herein relates to an 

10 isolated polynucleotide selected from one of the 

following: SEQ ID NO:l; SEQ ID NO: 3; SEQ ID NO: 5; SEQ ID 
NO: 7; SEQ ID NO: 9; SEQ ID NO: 11; SEQ ID NO: 13; an RNA 
analog of SEQ ID NO:l, 3, 5, 7, 9, 11, 13, or 15; and a 
polynucleotide having a nucleic acid sequence 

15 complementary to one of the above. The polynucleotide 
can also be a nucleic acid fragment of one of the above 
sequences that is at least 15 nucleotides in length and 
that hybridizes under stringent conditions to genomic DNA 
encoding the polypeptide of SEQ ID N0:2, SEQ ID NO:4, SEQ 

20 ID N0:6, SEQ ID N0:8, SEQ ID NO: 10, SEQ ID N0:12, or SEQ 
ID NO: 14. 

Also disclosed herein is an isolated polypeptide 
that has an amino acid sequence substantially identical 
to one of the following: SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

25 NO:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID NO:12, or SEQ ID 
NO: 14. Also disclosed are isolated polynucleotides 
encoding polypeptides substantially identical in their 
amino acid sequence to: SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID NO:12, or SEQ ID 

3 0 NO: 14. 

The invention also relates to a transgenic plant 
containing a nucleic acid construct . The nucleic acid 
construct comprises a polynucleotide described above. 
The construct further comprises a regulatory element 



WO 98/54954 



PCT/US98/11384 



- 5 - 

operably linked to the polynucleotide. The regulatory 
element may a tissue -specific promoter, for example, an 
epidermal cell -specif ic promoter or a seed-specific 
promoter. The regulatory element may be operably linked 
5 to the polynucleotide in sense or antisense orientation. 
The plant has altered levels of very long chain fatty 
acids in tissues where the polynucleotide is expressed, 
compared to a parental plant lacking the nucleic acid 
construct . 

10 A method is disclosed for altering the levels of 

very long chain fatty acids in a plant. The method 
comprises the steps of creating a nucleic acid construct 
and introducing the construct into the plant. The 
construct includes a polynucleotide selected from one of 

15 the following: SEQ ID N0:1; SEQ ID NO: 3; SEQ ID NO: 5; SEQ 
ID N0:7; SEQ ID NO:9; SEQ ID "NO: 11; SEQ ID NO:13; an RNA 
analog of SEQ ID N0:1, 3, 5, 7, 9, 11, 13, or 15; and a 
polynucleotide having a nucleic acid sequence 
complementary to one of the above. The polynucleotide 

20 can also be a nucleic acid fragment of one of the above 
that is at least 15 nucleotides in length and that 
hybridizes under stringent conditions to genomic DNA 
encoding the polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ 
ID N0:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, or SEQ 

25 ID N0:14. The polynucleotide is effective for altering 
the levels of very long chain fatty acids in the plant. 

Other features and advantages of the invention 
will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

30 Brief Description of the Drawings 

Figure 1 shows the time course of in vitro VLCFA 
synthesis by FAE1 expressed in yeast, with 3 different 
acyl-CoA substrates. 
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Figure 2 shows the rates of in vitro VLCFA 
synthesis and the VLCFA profiles of FAB1 expressed in 
yeast, with 3 different acyl-CoA substrates. 

Figure 3 shows the nucleotide sequence of the 
5 coding region of the Arabidopsis ELI polynucleotide (SEQ 
ID N0:1) . 

Figure 4 shows the deduced amino acid sequence 
(SEQ ID NO: 2) for the ELI coding sequence of Figure 3. 
Figure's shows the nucleotide sequence of the 
10 coding region of the Arabidopsis EL2 polynucleotide (SEQ 
ID NO:3) . 

Figure 6 shows the deduced amino acid sequence 
(SEQ ID NO: 4) for the EL2 coding sequence of Figure 5. 
Figure 7 shows the nucleotide sequence of the 
15 coding region of the Arabidopsis EL3 polynucleotide (SEQ 
ID N0:5) . 

Figure 8 shows the deduced amino acid sequence 
(SEQ ID NO: 6) for the EL3 coding sequence of Figure 7. 
Figure 9 shows the nucleotide sequence of the 

2 0 coding region of the Arabidopsis EL4 polynucleotide (SEQ 

ID NO:7) . 

Figure 10 shows the deduced amino acid sequence 
(SEQ ID NO: 8) for the EL4 coding sequence of Figure 9. 

Figure 11 shows the nucleotide sequence of the 
25 coding region of the Arabidopsis ELS polynucleotide (SEQ 
ID NO:9) . 

Figure 12 shows the deduced amino acid sequence 
(SEQ ID NO: 10) for the EL5 coding sequence of Figure 11. 
Figure 13 shows the nucleotide sequence of the 

3 0 coding region of the Arabidopsis EL6 polynucleotide (SEQ 

ID NO: 11) • 

Figure 14 shows the deduced amino acid sequence 
(SEQ ID NO: 12) for the EL6 coding sequence of Figure 13. 
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Figure 15 shows the nucleotide sequence of the 
coding region of the Arabidopsis EL7 polynucleotide (SEQ 
ID N0:13) . 

Figure 16 shows the deduced amino acid sequence 
5 (SEQ ID NO: 14) for the EL7 coding sequence of Figure 15. 



Description of the Preferred Embodiments 
The present invention comprises isolated nucleic 
acids (polynucleotides) that encode polypeptides having 
fi-ketoacyl synthase activity. The novel polynucleotides 

10 and polypeptides of the invention are involved in the 
synthesis of very long chain fatty acids and are useful 
for modulating the total amounts of such fatty acids and 
the specific VLCFA profile in plants. 

A polynucleotide of the invention may be in the 

15 form of RNA or in the form of DNA, including cDNA, 

synthetic DNA or genomic DNA. The DNA may be double - 
stranded or single -stranded, and if single-stranded, can 
be either the coding strand or non- coding strand. An RNA 
analog may be, for example, mRNA or a combination of 

20 ribo- and deoxyribonucleotides . Illustrative examples of 
a polynucleotide of the invention are shown in Figs. 3, 
5, 7, 9, 11, 13 and 15. 

A polynucleotide of the invention typically is at 
least 15 nucleotides (or base pairs, bp) in length. In 

25 some embodiments, a polynucleotide is about 20 to 100 

nucleotides in length, or about 100 to 500 nucleotides in 
length. In other embodiments, a polynucleotide is 
greater than about 1500 nucleotides in length and encodes 
a polypeptide having the amino acid sequence shown in 

30 Figs. 4, 6, 8, 10, 12, 14 or 16. 

In some embodiments, a polynucleotide of the 
invention encodes analogs or derivatives of a polypeptide 
having the deduced amino acid sequence of Figs. 4, 6, 8, 
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10, 12, 14 or 16. Such fragments, analogs on derivatives 
include, for example, naturally occurring allelic 
variants, non-naturally occurring allelic variants, 
deletion variants and insertion variants, that do not 
5 substantially alter the function of the polypeptide. 

A polynucleotide of the invention may further 
comprise additional nucleic acids. For example, a 
nucleic acid fragment encoding a secretory or leading 
amino acid sequence can be fused in- frame to the amino 

10 terminal end of one of the ELI through EL7 polypeptides. 
Other nucleic acid fragments are known in the art that 
encode amino acid sequences useful for fusing in- frame to 
the KAS polypeptides disclosed herein. See, e.g., U.S. 
5,62 9,193 incorporated herein by reference. A 

15 polynucleotide may further comprise one or more 
regulatory elements operably linked to a KAS 
polynucleotide disclosed herein. 

The present invention also comprises 
polynucleotides that hybridize to a KAS polynucleotide 

20 disclosed herein. Such a polynucleotide typically is at 
least 15 nucleotides in length. Hybridization typically 
involves Southern analysis (Southern blotting) , a method 
by which the presence of DNA sequences in a target 
nucleic acid mixture are identified by hybridization to a 

25 labeled oligonucleotide or DNA fragment probe. Southern 
analysis typically involves electrophoretic separation of 
DNA digests on agarose gels, denaturation of the DNA 
after electrophoretic separation, and transfer of the DNA 
to nitrocellulose, nylon, or another suitable membrane 

30 support for analysis with a radiolabeled, biotinylated, 
or enzyme-labeled probe as described in sections 9.37- 
9.52 of Sambrook et al , , (1989) Molecular Cloning, second 
edition, Cold Spring Harbor Laboratory, Plainview; NY. 
A polynucleotide can hybridize under moderate 

35 stringency conditions or, preferably, under high 



WO 98/54954 



PCT/US98/11384 



- 9 - 

stringency conditions to a KAS polynucleotide disclosed 
herein. High stringency conditions are used to identify 
nucleic acids that have a high degree of homology to the 
probe. High stringency conditions can include the use of 
5 low ionic strength and high temperature for washing, for 
example, 0.015 M NaCl/0.0015 M sodium citrate (0.1X SSC); 
0.1% sodium lauryl sulfate (SDS) at 65°C. Alternatively, 
a denaturing agent such as formamide can be employed 
during hybridization, e.g., 50% formamide with 0.1% 

10 bovine serum albumin/0.1% Ficoll/0.1% 

polyvinylpyrrolidone/ 50 mM sodium phosphate buffer at pH 
6.5 with 750 mM NaCl, 75 mM sodium citrate at 42°C. 
Another example is the use of 50% formamide, 5 x SSC 
(0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium 

15 phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA (50 
/ig/ml) , 0.1% SDS, and 10% dextran sulfate at 42°C, with 
washes at 42 °C in 0.2 x SSC and 0.1% SDS. 

Moderate stringency conditions refers to 

20 hybridization conditions used to identify nucleic acids 
that have a lower degree of identity to the probe than do 
nucleic acids identified under high stringency 
conditions. Moderate stringency conditions can include 
the use of higher ionic strength and/or lower 

25 temperatures for washing of the hybridization membrane, 
compared to the ionic strength and temperatures used for 
high stringency hybridization. For example, a wash 
solution comprising 0.060 M NaCl/0.0060 M sodium citrate 
(4X SSC) and 0.1% sodium lauryl sulfate (SDS) can be used 

30 at 50°C, with a last wash in IX SSC, at 65°C. 

Alternatively, a hybridization wash in IX SSC at 37°C can 
be used. 

Hybridization can also be done by Northern 
analysis (Northern blotting) , a method used to identify 
35 RNAs that hybridize to a known probe such as an 
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oligonucleotide, DNA fragment, cDNA or fragment thereof, 
or RNA fragment. The probe is labeled with a 
radioisotope such as 32 P, by biotinylation or with an 
enzyme. The RNA to be analyzed can be usually 
5 elect rophoretically separated on an agarose or 

polyacrylamide gel, transferred to nitrocellulose, nylon, 
or other suitable membrane, and hybridized with the 
probe, using standard techniques well known in the art 
such as those described in sections 7.39-7.52 of Sambrook 

10 et al . , supra. 

A polynucleotide has at least about 70% sequence 
identity, preferably at least about 80% sequence 
identity, more preferably at least about 90% sequence 
identity to SEQ ID NO:l, 3, 5, 7, 9, 11, or 13. Sequence 

15 identity can be determined, for example, by computer 

programs designed to perform single and multiple sequence 
alignments . 

A polynucleotide of the invention can be obtained 
by chemical synthesis, isolation and cloning from plant 

2 0 genomic DNA or other means known to the art, including 
the use of PCR technology carried out using 
oligonucleotides corresponding to portions of SEQ ID 
NO:l, 3, 5, 7-9, 11 or 13. Polymerase chain reaction 
(PCR) refers to a procedure or technique in which target 

25 nucleic acid is amplified in a manner similar to that 
described in U.S. Patent No. 4,683,195, incorporated 
herein by reference, and subsequent modifications of the 
procedure described therein. Generally, sequence 
information from the ends of the region of interest or 

30 beyond is employed to design oligonucleotide primers that 
are identical or similar in sequence to opposite strands 
of the template to be amplified. PCR can be used to 
amplify specific RNA sequences, specific DNA sequences 
from total genomic DNA, and cDNA transcribed from total 

35 cellular RNA, bacteriophage or plasmid sequences, and the 
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like. Alternately, a cDNA library (in an expression 
vector) can be screened with KAS-specific antibody 
prepared using peptide sequence (s) from hydrophilic 
regions of the KAS protein of SEQ ID NO: 2 and technology 
5 known in the art . 

A polypeptide of the invention comprises an 
isolated polypeptide having the deduced amino acid 
sequence of Figs. 2, 4, 6, 8, 10 and 12, as well as 
derivatives and analogs thereof. By "isolated" is meant 

10 a polypeptide that is expressed and produced in an 
environment other than the environment in which the 
polypeptide is naturally expressed and produced. For 
example, a plant polypeptide is isolated when expressed 
and produced in bacteria or fungi. Similarly, a plant 

15 polypeptide is isolated when its gene coding sequence is 
operably linked to a chimeric regulatory element and 
expressed in a tissue where the polypeptide is not 
naturally expressed. A polypeptide of the invention also 
comprises variants of the KAS polypeptides disclosed 

20 herein, as discussed above. 

A full-length KAS coding sequence may comprise the 
sequence shown in SEQ ID NO:l r 3, 5, 7, 9, 11 or 13. 
Alternatively, a chimeric full-length KAS coding sequence 
may be formed by linking, in- frame, nucleotides from the 

25 5' region of a first KAS gene to nucleotides from the 3' 
region of a second KAS gene, thereby forming a chimeric 
KAS protein. 

It should be appreciated that nucleic acid 
fragments having a nucleotide sequence other than the KAS 

3 0 sequences disclosed in SEQ ID NO:l, 3, 5, 7, 9, 11 or 13 
will encode a polypeptide having the exemplified amino 
acid coding sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12 or 
14, respectively. The degeneracy of the genetic code is 
well-known to the art; i.e., for many amino acids, there 
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is more than one nucleotide triplet which serves as the 
codon for the amino acid. 

It should also be appreciated that certain amino 
acid substitutions can be made in protein sequences 
5 without affecting the function of the protein. 

Generally, conservative amino acid substitutions or 
substitutions of similar amino acids are tolerated 
without affecting protein function. Similar amino acids 
can be those that are similar in size and/or charge 

10 properties, for example, aspartate and glutamate and 
isoleucine and valine are both pairs of similar amino 
acids. Similarity between amino acid pairs has been 
assessed in the art in a number of ways. For example, 
Dayhoff et al . (1978) in Atlas of Protein Sequence and 

15 Structure, Vol. 5, Suppl . 3, pp. 345-352, which is 
incorporated by reference herein, provides frequency 
tables for amino acid substitutions which can be employed 
as a measure of amino acid similarity. 

A nucleic acid construct of the invention 

20 comprises a polynucleotide as disclosed herein linked to 
another, different polynucleotide. For example, a full- 
length KAS coding sequence may be operably fused in- frame 
to a nucleic acid fragment that encodes a leader 
sequence, secretory sequence or other additional amino 

25 acid sequences that amy be usefully linked to a 
polypeptide or peptide fragment. 

A transgenic plant of the invention contains a 
nucleic acid construct as described herein. In some 
embodiments, a transgenic plant contains a nucleic acid 

3 0 construct that comprises a polynucleotide of the 
invention operably linked to at least one suitable 
regulatory sequence in sense orientation. Regulatory 
sequences typically do not themselves code for a gene 
product. Instead, regulatory sequences affect the 

35 expression level of the polynucleotide to which they are 
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linked. Examples of regulatory sequences are known in 
the art and include, without limitation, minimal 
promoters and promoters of genes preferentially or 
exclusively expressed in seeds or in epidermal cells of 
5 stems and leaves. Native regulatory sequences of the 
polynucleotides disclosed herein can be readily isolated 
by those skilled in the art and used in constructs of the 
invention. Other examples of suitable regulatory 
sequences include enhancers or enhancer- like elements, 

10 introns, 3' non-coding regions such as poly A sequences 
and other regulatory sequences discussed herein. 
Molecular biology techniques for preparing such chimeric 
genes are known in the art. 

In other embodiments, a transgenic plant contains 

15 a nucleic acid construct comprising a partial or a full- 
length KAS coding sequence operably linked to at least 
one suitable regulatory sequence in antisense 
orientation. The chimeric gene can be introduced into a 
plant and transgenic progeny displaying expression of the 

2 0 antisense construct are identified. 

One may use a polynucleotide disclosed herein for 
cosuppression as well as for antisense inhibition. 
Cosuppression of genes in plants may be achieved by 
expressing, in the sense orientation, the entire or 
25 partial coding sequence of a gene. See, e.g., WO 
04\11516, incorporated herein by reference. 

Transgenic techniques for use in the invention 
include, without limitation, Agrobacteri um-mediated 
transformation, viral vector-mediated transformation 

3 0 electroporation and particle gun transformation. 

Illustrative examples of transformation techniques are 
described in U.S. Patent 5,204,253, (particle gun) and 
U.S. Patent 5,188,958 (Agrrobacterium) , incorporated 
herein by reference. Transformation methods utilizing 
35 the Ti and Ri plasmids of AgroJbacterium spp. typically 
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use binary-type vectors. Walkerpeach, C. et al . , in 
Plant Molecular Biology Manual, S. Gelvin and R. 
Schilperoort , eds., Kluwer Dordrecht, Cl:l-19 (1994). If 
cell or tissue cultures are used as the recipient tissue 
5 for transformation, plants can be regenerated from 

transformed cultures by techniques known to those skilled 
in the art . 

Techniques are known for the introduction of DNA 
into monocots as well as dicots, as are the techniques 

10 for culturing such plant tissues and regenerating those 
tissues. Monocots which have been successfully 
transformed and regenerated include wheat, corn, rye, 
rice, and asparagus. See, e.g., U.S. Patent Nos. 
5,484,956 and 5,550,318, incorporated herein by 

15 reference. 

For efficient production of transgenic plants from 
plant cells, it is desirable that the plant tissue used 
for transformation possess a high capacity for 
regeneration. Transgenic plants of woody species such as 

2 0 poplar and aspen have also been obtained. Technology is 

also available for the manipulation, transformation, and 
regeneration of gymnosperm plants. For example, U.S. 
Patent No. 5,122,466 describes the biolistic 
transformation of conifers, with preferred target tissue 

25 being meristematic and cotyledon and hypocotyl tissues. 
U.S. Patent No. 5,041,382 describes enrichment of conifer 
embryonal cells. 

Seeds produced by a transgenic plant (s) can be 
grown and then selfed (or outcrossed and selfed) to 

30 obtain seeds homozygous for the construct. Seeds can be 
analyzed in order to identify those homozygotes having 
the desired expression of the construct. Transgenic 
plants may be entered into a breeding program, e.g., to 
introgress the novel construct into other lines, to 

3 5 transfer the construct to other species, or for further 
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selection of other desirable traits. Alternatively, 
transgenic plants may be propagated vegetatively for 
those species amenable to such techniques. A nucleic 
acid construct of the invention can alter the levels of 
5 very long chain fatty acids in plant tissues expressing 
the polynucleotide, compared to VLCFA levels in 
corresponding tissues from an otherwise identical plant 
not expressing the polynucleotide. A comparison can be 
made, for example, between a non- transgenic plant of a 

10 plant line and a transgenic plant of the same plant line. 
Levels of VLCFAs having 20-32 carbons and/or levels of 
VLCFAs having 32-60 carbons can be altered in plants 
disclosed herein. Plants having an altered VLCFA 
composition may be identified by techniques known to the 

15 skilled artisan, e.g., thin layer chromatography or gas- 
liquid chromatography (GLC) analysis of the appropriate 
plant tissue. 

A suitable group of plants with which to practice 
the invention are the Brassica species, including B. 

20 napus, B. rapa, B.juncea, and B. hirta. Other suitable 
plants include, without limitation, soybean (Glycine 
max) , sunf lower (Helianthus annuus) and com (Zea mays) . 

A method according to the invention comprises 
introducing a nucleic acid construct into a plant cell 

25 and producing a plant (as well as progeny of such a 
plant) from the transformed cell. Progeny includes 
descendants of a particular plant or plant line, e.g., 
seeds developed on an instant plant are descendants. 
Progeny of an instant plant include seeds formed on F lf 

3 0 F 2 , F 3 , and subsequent generation plants, or seeds formed 
on BC 1# BC 2 , BC 3 , and subsequent generation plants. 

Methods and compositions according to the 
invention are useful in that the resulting plants and 
plant lines have desirable alterations in very long chain 

35 fatty acid composition. Suitable tissues in which to 
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express polynucleotides and/or polypeptides of the 
invention include, without limitation, seeds, stems and 
leaves. Leaf tissues of interest include cells and 
tissues of the epidermis, e.g., cells that are involved 
5 in forming trichomes. Of particular interest are 

epidermal cells involved in forming the cuticular layer. 
The cuticular layer comprises various very long chain 
fatty acids and VLCFA derivatives such as alkanes, 
esters, alcohols and aldehydes. Altering the composition 

10 and amount of VLCFAs in epidermal cells and tissues may 
enhance defense mechanisms and drought tolerance of 
plants disclosed herein. 

Polynucleotides of the invention can be used as 
markers in plant genetic mapping and plant breeding 

15 programs. Such markers may include RFLP, RAPD, or PCR 
markers, for example. Marker-assisted breeding 
techniques may be used to identify and follow a desired 
fatty acid composition during the breeding process. 
Marker-assisted breeding techniques may be used in 

20 addition to, or as an alternative to, other sorts of 

identification techniques. An example of marker-assisted 
breeding is the use of PCR primers that specifically 
amplify a sequence from a desired KAS that has been 
introduced into a plant line and is being crossed into 

25 other plant lines. 

Plants and plant lines disclosed herein preferably 
have superior agronomic properties. Superior agronomic 
characteristics include, for example, increased seed 
germination percentage, increased seedling vigor, 

3 0 increased resistance to seedling fungal diseases (damping 
off, root rot and the like) , increased yield, and 
improved standability . 

While the invention is susceptible to various 
modifications and alternative forms, certain specific 

35 embodiments thereof are described in the general methods 
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and examples set forth below. It should be understood, 
however, that these examples are not intended to limit 
the invention to the particular forms disclosed but, 
instead the invention is to cover all modifications, 
5 equivalents and alternatives falling within the scope of 
the invention. 

EXAMPLES 
Example 1 

Cloning and Expression of FAE1 in Yeast Cells 

10 The open reading frame of the Arabidopsis FAE1 

gene was amplified directly by PCR, using Arabidopsis 
thaliana cv. Columbia genomic DNA as a template, pfu DNA 
polymerase and the following primers: 
5 ' CTCGAGGAGCAATGACGTCCGTTAA- 3 ' and 5'- 

15 CTCGAGTTAGGACCGACCGTTTTG-3 ' . The PCR product was blunt - 
end cloned into the Eco RV site of pBluescript 
(Stratagene, La Jolla, CA) , 

The FAE1 gene was excised from the Bluescript 
vector with BamHl, and then subcloned into the pYEUra3 

2 0 (Clontech, Palo Alto, CA) . pYEUra3 is a yeast 

centromere-containing, episomal plasmid that is 
propagated stably through cell division- The FAE1 gene 
was inserted downstream of a GAL1 promoter in pYEUra3 . 
The GAL1 promoter is induced when galactose is present in 
25 the medium and repressed when glucose is present in the 
growth medium. 

Insertion of the FAE1 gene in the sense 
orientation was confirmed by PCR, and pYEUr a3 / FAE1 was 
used to transform Saccharomyces cerevisiae strain AB1380 

3 0 using a lithium acetate procedure as described in Gietz, 

R. and Woods, R. , in Molecular Genetics of Yeast: 
Practical Approaches, Oxford Press, pp. 121-134 (1994) . 
Plasmid DNA was isolated from putative transformants, and 
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the presence of the FAE1 / p YEUr a 3 construct was confirmed 
by Southern analysis. 

Yeast transformed with pYEUra3 having FAE1 
operably linked to the GAL1 promoter were grown in the 
5 presence of galactose or glucose and were analyzed for 
the expression of FAE1 , As a control, yeast transformed 
with pYEUra3 containing no insert were also assayed. 
Analysis of such control preparations yielded fatty acid 
compositions and fatty acid elongation rates similar to 
10 those of yeast transformed with p YEUr a 3 / FAE1 and grown 
with glucose as the carbon source. 

The fatty acid composition of yeast cells grown in 
the presence of galactose was compared to that of cells 
grown in the presence of glucose, to determine if VLCFA 
15 were found in the galactose -induced cells. 

Transformed yeast cells were grown overnight in 
YPD media at 30°C with vigorous shaking. One hundred /il 
of the overnight culture were used to inoculate 40 ml of 
complete minimal uracil dropout media (CM-Ura) 

2 0 supplemented with either glucose or galactose (2% w/v) . 

Cultures were grown at 30°C to an OD 600 of approximately 
1,3 to 1.5. Cells were harvested by centrifugation at 
5000 Xg for 10 min. Total lipids were extracted from the 
cells with 2 volumes of 4N KOH in 100% methanol for 60 
25 min, at 80°C. Fatty acids were saponified and methyl 
esters were prepared by drying the samples and 
resuspending in 0 . 5 ml of boron trichloride in methanol 
(10% v/v) . Samples were incubated at 50 °C for 15 min in 
a sealed tube. About 2 ml of water was then added and 

3 0 the fatty methyl esters were extracted thrice with 1 ml 

of hexane. Extracts were dried under nitrogen and 
redissolved in hexane. See Hlousek-Radojcic, A. et al., 
Plant J. 8:803-809. Methyl esters were analyzed on an HP 
5890 series II gas chromatograph equipped with a 5771MSD 
35 and 7673 auto injector (Hewlett-Packard, Cincinnati/ OH) . 
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Methyl esters were separated on a DB-23 (j&w Scientific) 
capillary column (30 m X 0.25 mm X 0.25 /im) . The column 
was operated with helium carrier gas and splitless 
injection (injection temperature 280°C, detector 
5 temperature 280°C) . After an initial 3 min. at 100°C, 
the oven temperature was raised to 250° at 20°C min" 1 and 
maintained at that temperature for an additional 3 min. 
The identity of the peaks was verified by 
cochromatography with authentic standards and by mass 

10 spectrometer analysis. 

The results clearly revealed the appearance of 
both 20:1 and 22:1 acyl-CoA products in galactose- induced 
yeast containing the FAE1 coding sequence. Uninduced 
yeast cells failed to accumulated significant amounts of 

15 fatty acids longer than C18. These results indicate that 
expression of FAE1 in yeast resulted in functional KAS 
activity and functional elongase activity. 

Example 2 
FAE1 Activity in Yeast Microsomes 

20 The functional expression of the FAJE1 KAS was 

analyzed by isolating microsomes from transformed yeast 
cells and assaying these microsomes in vitro for elongase 
activity. 

Transformed yeast cells were grown in the presence 
25 of either glucose or galactose (2% w/v) as described in 
Example 1. Cells were harvested by centrif ugation at 
5000 Xg for 10 min and washed with 10 ml ice cold 
isolation buffer (IB) , which contains 80 mM Hepes-KOH, pH 
7.2, 5 mM EGTA, 5 mM EDTA, 10 mM KC1 , 320 mM sucrose and 
30 2 mM DTT) . Cells were then resuspended in enough IB to 
fill a 1.7 ml tube containing 700 /xl of 0.5 /im glass 
beads and yeast microsomes were isolated from the cells 
essentially as described in Tillman, T* and Bell, R. , J. 
Biol. Chem. 261:9144-9149 (1986). The microsomal 
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membrane pellet was recovered by centrif ugation at 
252,000 xg for 60 min. The pellet was rinsed by 
resuspending in 40 ml fresh IB and again recovered by 
centrif ugation at 252000 Xg for 60 min. Microsomal 
5 pellets were resuspended in a minimal volume of IB, and 
the protein concentration adjusted to 2.5 jig /xl* 1 by 
addition of IB containing 15% glycerol. Microsomes were 
frozen on dry ice and stored at -80°C. The protein 
concentration in microsomes was determined by the 

10 Bradford method (Bradford, 1976) . 

Fatty acid elongase activity was measured 
essentially as described in Hlousek-Radojcic, A. et al . , 
Plant J . 8:803-809 (1995). Briefly, the standard 
elongation reaction mix contained 8 0 mM Hepes-KOH, pH 

15 7.2, 20 mM MgCl 2 , 500 //M NADPH, 1 mM ATP, 100 uM malonyl- 
CoA, 10 fM CoA-SH and 15 jzM radioactive acyl-CoA 
substrate. The radiolabeled substrate was either [1" 
14 C]18:l-CoA (50 uCi /zmol" 1 ) , [1' 14 C] 18 : 0-CoA (55 uCi /imol" 
1 ) , or [1" 14 C] 16 :0-CoA (54 uCi /xmol" 1 ) . The reaction was 

20 initiated by the addition of yeast microsomes (5 /xg 
protein) and the mixture incubated at 30° C for the 
indicated period of time. The final reaction volume was 

25 Mi- 
Methyl esters of the acyl-CoA elongation products 

25 were prepared as described in Example 1. Methyl esters 
were separated on reversed phase silica gel KC18 TLC 
plates (Whatman, 250 uM thick) , quantified by 
phosphor imaging, and analyzed on by ImageQuant software 
(Molecular Dynamics, Inc., Sunnyvale, CA) . The detection 

30 ^imit for each product is about 0.001 nanomoles per min. 
per mg microsomal protein, depending on the phosphorimage 
exposure time. 

Results of representative in vitro elongation 
assays are shown in Figs. 1 and 2. The results indicate 

35 that microsomes from galactose -induced cells expressing 
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FAE1 catalyzed multiple cycles of elongation starting 
with either C16:0 acyl CoA, C18:0 acyl CoA, or C18:l 
acyl-CoA as the substrate (Fig. 1). The 16:0 and 18:0 
acyl-CoA substrates were elongated to C26:0 acyl-CoA. In 
5 contrast, the 18:l-CoA substrate was elongated primarily 
to C20:l, with only low levels of C22:l acyl -CoA being 
produced. Occasionally, trace levels of C24:l CoA were 
also observed. Although the chain length of the products 
from the 18:1 acyl -CoA substrate were less than the chain 

10 length from the saturated acyl -CoA substrates, the rate 
of elongation of oleoyl-CoA was about 2- and 3 -fold 
higher than the rates of elongation of 16:0-CoA and 18:0- 
CoA, respectively. 

The elongation activity observed in microsomes 

15 from uninduced cells indicated that there was a low level 
of endogenous elongase activity when 18:l-CoA or 18:0-CoA 
were used as substrates. There was substantial 16:0-CoA 
elongase activity (10.1 nmol mg protein" 1 at 30 min) in 
microsomes from uninduced cells (Fig. 2) . However, the 

20 major product of 16:0 elongation using uninduced 

microsomes was C18:0 acyl CoA, with only small amounts of 
products beyond this length. The elongation of the 16:0 
acyl -CoA substrate presumably is due to an endogenous 
yeast elongase. 

25 Elongation of 18:1 CoA by microsomes from induced 

cells occurred at a rate about 18 -fold higher than in 
microsomes isolated from the uninduced cells (Fig. 2) . 
With microsomes from induced yeast, synthesis of 20:0 CoA 
from the 16:0 CoA substrate, occurred at a rate similar 

30 to that seen when the substrate was 18:0 CoA (4.2 vs. 5.1 
nmol mg protein" 1 ) . The total rate of elongation of [ 14 C] 
16:0-CoA by microsomes from induced cells (15.8 nmol mg 
protein" 1 at 30 min.) was more than 50% higher than 
elongation of [ 14 C] 16:0-CoA by microsomes from uninduced 

35 cells, suggesting that the FAE1 KAS utilized 16:0 -CoA as 
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a substrate in addition to C18-C24 acyl-CoAs. The FAE1 
elongase KAS activity, i.e., the difference in the 16:0 
elongation rates between microsomes from induced and 
uninduced cells, was 5.7 nmol mg protein" 1 . The 
5 elongation rate with the 16:0 substrate thus was similar 
to the elongase activity of the FAE1 elongase KAS with 
the 18:0 substrate. 

These results indicate that FAE1 KAS expressed in 
yeast could synthesize 3-ketoacyl-CoA in vitro and, in 
10 combination with yeast reductases and dehydrases, could 
form a functional VLCFA elongase complex. In addition, 
these results suggest that FAE1 is membrane -bound in 
yeast cells. 

Example 3 

15 Cloning and Sequencing of Arabldopsis Elongase Genes 

The sequence of a jojoba seed cDNA (see WO 
93/10241 and WO 95/15387, incorporated herein by 
reference) was used to search the Arabidopsis expressed 
sequence tag (EST) database of the Arabidopsis Genome 

20 Stock Center (The Ohio State University, Columbus, Ohio) . 
The BLAST computer program (National Institutes of 
Health, Bethesda, MD, USA) was used to perform the 
search. The search identified two ESTs (ATTS1282 and 
ATTS3218) that had a high degree of sequence identity 

25 with the jojoba sequence. The ATTS1282 and ATTS3218 ESTs 
appeared to be partial cDNA clones rather than full- 
length clones based on the length of the jojoba sequence. 

A genomic DNA library from Arabidopsis thaliana 
cv. Columbia, was prepared in the lambda GEM11 vector 

30 (Promega, Madison, Wisconsin) and was obtained from Ron 
Davis, Stanford University, Stanford, CA. The library 
was hybridized with ATTS1282 and ATTS3218 as probes and 2 
clones were identified for each EST. Phage DNA was 
isolated from each of the hybridizing clones, the genomic 
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insert was excised with the restriction enzyme Sac I and 
subcloned into the plasmid pBluescript (Stratagene, La 
Jolla, CA) . One clone from the ATTS1282 hybridization 
was designated ELI and one clone from the ATTS3218 
5 hybridization was designated EL2 . 

A yeast expression library, containing cDNA from 
Arabidopsis thaliana cv, Columbia, was prepared in the 
lambda YES expression vector described in Elledge et al . 
(Elledge*, S. et al., Proc. Natl. Acad. Sci USA 88:1731- 

10 1735 (1991) and was obtained from Ron Davis at Stanford 
University, Stanford, CA. The library was hybridized 
with a EL2 partial cDNA probe. A full-length EL2 cDNA 
was not identified. However, the probe did identify a 
full-length cDNA which was designated EL3 . 

15 A consensus sequence for the C- terminal region of 

ELI, EL2 and the jojoba cDNA polypeptides was identified 
by sequence alignment using DNA analysis programs from 
DNAStar, Madison, Wisconsin. This consensus sequence was 
used to search the Arabidopsis EST database again for fi- 

20 keto acyl synthase sequences. These searches identified 
four additional putative S-keto acyl synthase ESTs, which 
were designated EL4 through EL7 . EL4, EL5, EL6, and EL 7 
have homology to Genbank Accession Nos. T04345, T44939, 
T22193 and T76700, respectively. 

25 The lambda YES cDNA expression library described 

above was hybridized with the ELI and EL4-EL7 ESTs as 
probes. This screen identified full-length cDNAs for 
ELI, ELS and EL6 . 

The lambda GEM11 genomic library was hybridized 

30 with the EL4 and EL7 ESTs as probes. This screen 

identified full-length genomic clones for EL4 and EL7. 
Phage DNA was isolated from each of the hybridizing 
clones and subcloned into pBluescript as described above. 
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The 7 EL clones were sequenced on both strands 
with regions of overlap for each sequence run. 
Sequencing was carried out with an ABI automated 
sequencer (Applied Biosystems, Inc., Foster City, 
5 California), following the manufacturer's instructions* 

The nucleotide sequences for the coding regions of 
EL1-EL7 are shown in Figs. 3, 5, 7, 9, 11, 13 and 15, 
respectively. The deduced amino acid sequences for EL1- 
EL7 are shown in Figs. 4, 6, 8, 10, 12, 14 and 16, 

10 respectively, using the standard one-letter amino acid 
code. The ELI , EL2 and EL7 genomic clones appeared to 
lack introns. The EL4 genomic clone contained one intron 
near the 5' end of the coding region. 

The nucleotide sequences of the 7 EL 

15 polynucleotides were compared to 5 DNA sequences present 
in Genbank. Genbank, National Center for Biotechnology 
Information, Bethesda, MD. Two of the 5 accessions were 
cloned from members of the Brassicaceae : the Arabidopsis 
FAE1 sequence (Accession U29142) and a Brassica napus 

20 sequence (Accession U50771) . Three of the accessions 
were cloned from jojoba {Simmondsia chinensis) : 2 wax 
biosynthesis genes (Accessions 114084 and 114085) and a 
jojoba KAS gene (Accession U37088) . See also U.S. Patent 
5,445,947, incorporated herein by reference. 

25 Multiple alignment of the 12 sequences was carried 

out with a computer program sold under the trade name 
MEGALIGN Lasergene by DNAStar (Madison, Wisconsin) . 
Alignments were done using the Clustal method with 
weighted residue weight table. The nucleotide sequence 

30 similarity index and percent divergence based on the 
multiple alignment algorithm is shown in Table 1. The 
nucleotide sequences of EL1-EL7 are distinguishable from 
the 5 DNA sequences obtained from Genbank. 

The deduced amino acid sequences of the ELI -7 

3 5 polypeptides were compared with the MEGALIGN program to 
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the deduced amino acid sequences of the same 5 Genbank 
clones, using the Clustal method with PAM2 50 residue 
weight table. The amino acid sequence similarity and 
percent divergence are shown in Table 2 . The amino acid 
5 sequences of ELI -EL 7 polypeptides are distinguishable 
from those of the Genbank sequences. 
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Example 4 
Expression of ELI and EL2 in Yeast 

The open reading frames (ORFs) for the EL2, EL4 
and EL7 clones were amplified by PCR. The EL2 ORF was 
cloned into XYES using the primers: 

CTCGAGCAAGTCCACTACCACGCA and CTCGAGCGAGTCAGAAGGAACAAA . 
The EL4 ORF was cloned into pYEUra3 using the primers: 
GATAATTTAGAGAGGCACAGGGT and GTCGACACAAGAATGGGTAGATCCAA . 
The EL7 ORF was cloned into pYEUra3 using the primers: 
CAGTTCCTCAAACGAAGCTA and GTCGACTTCTCAATGGACGGTGCCGGA . 
Amplified products were cloned into pYEUra3 under the 
control of, and 3' to, the GAL1 promoter • The resulting 
plasmids were transformed into yeast as described in 
Example 1 . 

Yeast cultures containing full-length ELI in XYES 
and full-length EL2 in pYEUra3 were grown in the presence 
of galactose or glucose as described in Example 2 . 
Microsomes were then prepared from each of the cultures 
and fatty acid elongation assays were carried out as 
described in Example 2. 

In the first experiment, microsomes were prepared 
from galactose -induced cultures of ELI, EL2 and FAE1, and 
incubated with either [1- 14 C] 18:0 acyl-CoA or [1- 14 C] 18:1 
acyl-CoA as substrate. The amounts of various reaction 
products synthesized after 3 0 minutes (min) were 
determined as described in Example 2. The results when 
18:0 acyl-CoA was the substrate are shown in Table 3. 
The results when 18:1 acyl-CoA was the substrate are 
shown in Table 4 . 
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table 3. 

Elongation of 18»0-CoA by FAH1, ELI and EL2 Genes 
Expressed in Yeast 







£-Keto Acyl Synthase Gene 




FAE1 


ELI 


EL2 


Acyl-CoA 
Product 


Rate 1 




Rate 




Kate 


1%) 


20:0 


0.369 


64.3 


0.084 


38.8 


0.108 


41.8 


22:0 


0.113 


18.6 


0.047 


21.9 


0.053 


20.7 


24:0 


0.065 


10.7 


0.043 


19.9 


0.052 


20.3 


26:0 


0.038 


6.3 


0.042 


19.4 


0.044 


17.2 


Total 


0.585 


100.0 


0.216 


100.0 


0.258 


100.0 



1 Hanonoles /minute /wg of microsomal protein 



Table 4. 

Klongation of I81I-C0A by 7AB1, XL1 and SL2 Genes 
Expressed in Yeast 





S-Keto Acyl Synthase Gene 




FAE1 H ELI 


EL2 


| Acyl-CoA 
1 Product 


Rate 1 


(%) 


Rate 


(%) 


Rate 




H 20:1 


1.131 


64.6 


0.111 


80.8 


0.091 


84.1 


22:1 


0.206 


15.4 


0.026 


19.2 


0.017 


15.9 


24:1 1 0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


26:1 H 0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


Total l| 1.337 


100.0 


0.137 


100.0 


1 0.108 


100.0 



1 Nanomolea/mimjce/mg ot microsomal protein 



The results shown in Tables 3 and 4 indicate that 
the ELI and EL2 gene products have S-ketoacyl synthase 
(KAS) activity and that the KAS reaction product can be 
utilized to form VLCFAs. The specific activities of the 
3 KAS enzymes cannot be compared, since the relative 
amount of the heterologous KAS protein in each microsomal 
preparation is not known. However, the proportions of 
various reaction products can be compared between FAE1, 
ELI and EL2 . 

The data shown in Table 3 indicate that the ELI 
and EL2 KAS activities result in a higher proportion of 
saturated VLCFAs than does the FAE1 KAS activity. These 
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results suggest that ELI and EL2 encode novel gene 
products, because ELI and EL2 have a greater preference 
for C22:0 and C24:0 acyl-CoA substrates than does FAE1. 

A comparison of the relative elongation activity 
of FAE1 with 18:0 and 18:1 substrates (Tables 3 and 4) 
indicates that FAE1 is more active when 18:1 is the 
substrate than when 18:0 is the substrate. In contrast, 
the overall rate of product formation with ELI is less 
when 18:1 is the substrate than when 18:0 is the 
substrate (Tables 3 and 4) . EL2 is also less active when 
18:1 is the substrate than when 18:0 is the substrate 
(Tables 3 and 4) . These results support the conclusion 
that ELI and EL2 encode novel gene products and suggest 
that ELI and EL2 have a preference for saturated fatty 
acids as substrates, whereas the FAE1 gene product has a 
preference for monounsaturated fatty acids as substrates. 

In a second experiment, microsomes were prepared 
from galactose -induced and from glucose -repressed yeast 
cultures containing ELI or EL2 coding sequences. The 
microsomal preparations were incubated with either 18:0 
acyl-CoA or 18:1 acyl-CoA as substrate and the fatty acid 
reaction products determined as described above. The 
results with the 18:0 substrate are shown in Table 5. 
The results with the 18:1 substrate are shown in Table 6. 
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Table 5. 

Elongation of 18:0-CoA by ELI and EL2 
With and Without Induction of Gene Expression 





S-Keto Acyl 


mthaae Gene 


Acyl 


E 


LI 


EL2 


♦Glu 


coae 


♦Galactose 


♦Glucose 


♦Galactose 


CoA 


Rate 1 


(»> 


Rate 


(%> 


Rate 


(%) 


Rate 


it) 


20:0 


0.007 


100. 0 


0.074 


55-8 


0.030 


81.3 


0.107 


43.1 


22:0 


0.000 


o.o 


0.023 


17.4 


0.002 


5.1 


0.044 


X7.8 


24:0 


0.000 


0.0 


0.030 


15 .,3 


0.005 


13.6 


0.048 


19.1 


26i0 


0.000 


0 0 


0.015 


11. S 


0.000 


0.0 


0.050 


20 .0 


Total 

i. . i__ /_ j _ / 


0.007 


100.0 


0.133 


100.0 


0.037 


100.0 


0.249 


100.0 



1 Nanomoles/minute/ng of microsomal protein 



Table 6. 

Elongation of 18:l-CoA by ELI and EL2 
With and Without Induction of Gene Expression 



Acyl 
CoA 



S-Keto Acyl Synthase Gene 



ELI 



♦Glucoae 



Rate 1 



♦Galactose 



EL2 



Rate 



(%) 



♦Glucose 



(%) 



♦Galactose 



20 tl 



0.081 



0.Qs9 



100-0 



o.ooo 



o.o 



Total 



100.0 



Nanoa»les/minute/mg ot microsomal protein 



The results in Table 5 show in vitro elongase 
activity for ELI and EL2 under induced (galactose) and 
uninduced (glucose) conditions. The comparison indicates 
that induction with galactose results in a large increase 
in overall elongase activity when 18:0 acyl CoA is the 
substrate (about 19 -fold and 7-fold for ELI and EL2, 
respectively) . In contrast, induction when 18:1 acyl CoA 
is the substrate results in only a small increase in 
elongase activity (about 1.3 -fold and 2- fold for ELI and 
E12, respectively), as shown in Table 6. 

The results in Table 5 show that little or no 
VLCFA products are made by yeast microsomes under 
uninduced conditions. Upon induction of ELI and EL2 gene 
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expression, however, significant quantities of C20:0 # 
C22:0, C24:0 and C26:0 are made. The data in Tables 5 
and 6 are consistent with the results in Tables 3 and 4, 
which indicated that ELI and EL2 were more active with a 
saturated fatty acid substrate than with a 
monounsaturated substrate. 

The data in Tables 5 and 6 are also consistent 
with the data in Tables 3 and 4 indicating that the ELI 
and EL2 gene products are more active in converting C24:0 
to C26:0 than is FAE1. 

In a third experiment, microsomes from induced and 
uninduced cultures containing ELI or EL2 were incubated 
in the absence of cofactors involved in the fi-ketoacyl 
condensation reaction. Cultures were induced and 
microsomes were prepared as described in Example 2 . In 
vitro assays were carried out as described in Example 2, 
except that either ATP, CoASH or both were omitted from 
the enzyme reaction mixture. In addition, one reaction 
was carried out in a complete mixture having 0.01 mM of 
cerulenin (Sigma, St. Louis, MO). Cerulenin is an 
inhibitor of some condensing enzymes. The results are 
shown in Tables 7-9. 
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Table 7. 

Effect of Cof actors on 18:0-CoA Elongation 1 



Gene 


Expt 4 


+Glu 2 


+Gal J 


-ATP 3 


-CoA 3 


-A&C 3 


+ Cer 3 


ELI 


1 


.037 


.109 


.095 


.105 


.119 


.141 




2 


N.D. 


.090 


.125 


.093 


.270 


.176 


EL2 


1 


.033 


.112 


. 168 


.127 


.143 


.238 




2 


N.D. 


.120 


.178 


.133 


.195 


.302 



Activity in nanomoles /minute /mg of microsomal protein. 

2 +Glu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal: microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose-induced cultures. -ATP : ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 

4 Experiment No. 



WO 98/54954 



PCT/US98/ 11384 



- 34 - 
Table 8. 



Effect of Cofactors on Elongation Products of ELI 1 



Prod. 


+Glu 3 


+Gal 2 


-ATP 3 


-CoA 3 


-A&C 3 


+Cer 3 


20:0 


53.9 


46.2 


34.4 


47.8 


41.7 


46.7 


22 :0 


14 .4 


18.7 


13.7 


18.0 


19.4 


16.2 


24 :0 


18.5 


18.1 


20.6 


19.1 


16.7 


17.7 


26:0 


13.2 


17.1 


31 :4 


15.2 


22.3 


19.4 


Total 


100.0 


100.0 


100 .0 


100.0 


100.0 


100.0 



1 Amount of indicated product as a percent of total products formed. 
Results of one experiment for +Glucose; Average of two experiments for 
other conditions. 

2 +Glu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal: microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose- induced cultures. -ATP: ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 
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Table 9. 

Effect of Cof actors on Elongation Products of EL2 1 



Prod. 


+Glu J 


+Gal 2 


-ATP 3 


-CoA 3 


-A&C 3 


+Cer 3 


20:0 


54.5 


47. 1 


34.1 


45.3 


38.0 


41.8 


22:0 


17.1 


19.1 


16.4 


19.2 


15.9 


16.1 


24:0 


5.8 


19.4 


20.8 


19.9 


18.4 


20.4 


26:0 


22.6 


14 .5 


28.9 


15.8 


27.8 


21.8 


Total 


100.0 


100.0 


100.0 


100.0 


100 . 0 


100.0 



Amount of indicated product as a percent of total products formed. 
Results of one experiment for +Glucose; Average of two experiments for 
other conditions. 

2 +Glu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal : microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose- induced cultures. -ATP: ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 

The results in Table 7 indicate that omission of 
ATP and/or CoA from the incubation mixture does not have 
a significant effect on the overall amounts of VLCFAs 
synthesized by the in vitro KAS activity of ELI or EL2 . 
The results also show that cerulenin does not inhibit the 
KAS activity of ELI or EL2 . The data in Table 8 and 9 
confirm that ELI and EL2 KAS activity produces 
significant amounts of C24:0 and C26:0 acyl CoA products. 

To the extent not already indicated, it will be 
understood by those of ordinary skill in the art that any 
one of the various specific embodiments herein described 
and illustrated may be further modified to incorporate 
features shown in other of the specific embodiments. 

The foregoing detailed description has been 
provided for a better understanding of the invention only 
and no unnecessary limitation should be understood 
therefrom as some modifications will be apparent to those 
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skilled in the art without deviating from the spirit and 
scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: CARGILL, INCORPORATED 
(ii) TITLE OF THE INVENTION: FATTY ACID ELONGASES 



(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C., P. A. 

(B) STREET: 60 South Sixth Street, Suite 3300 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY : USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/868,373 

(B) FILING DATE: 03-JUN-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Lundquist, Ronald C 

(B) REGISTRATION NUMBER: 37,875 

(C) REFERENCE/DOCKET NUMBER: 07039/064WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-335-5050 

(B) TELEFAX: 612-288-9696 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGGATCGAG AGAGATTAAC GGCGGAGATG GCGTTTCGAG ATTCATCATC GGCCGTTATA 60 

AGAATTCGAA GACGTTTGCC GGATTTATTA ACGTCCGTTA AGCTCAAATA CGTGAAGCTT 120 

GGACTTCACA ACTCTTGCAA CGTGACCACC ATTCTCTTCT TCTTAATTAT TCTTCCTTTA 180 

ACCGGAACCG TGCTGGTTCA GCTAACCGGT CTAACGTTCG ATACGTTCTC TGAGCTTTGG 240 

TCTAACCAGG CGGTTCAACT CGACACGGCG ACGAGACTTA CCTGCTTGGT TTTCCTCTCC 300 

TTCGTTTTGA CCCTCTACGT GGCTAACCGG TCTAAACCGG TTTACCTAGT GGATTTCTCC 360 

TGCTACAAAC CGGAAGACGA GCGTAAAATA TCAGTAGATT CGTTCTTGAC GATGACTGAG 420 

GAAAATGGAT CATTCACCGA TGACACGGTT CAGTTCCAGC AAAGAATCTC GAACCGGGCC 480 
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GGTTTGGGAG ACGAGACGTA TCTGCCACGT GGCATAACTT CAACGCCCCC GAAGCTAAAT 54 0 

ATGTCAGAGG CACGTGCCGA AGCTGAAGCC GTTATGTTTG GAGCCTTAGA TTCCCTCTTC 600 

GAGAAAACCG GAATTAAACC GGCCGAAGTC GGAATCTTGA TAGTAAACTG CAGCTTATTC 660 

AATCCGACGC CGTCTCTATC AGCGATGATC GTGAACCATT ACAAGATGAG AGAAGACATC 72 0 

AAAAGTTACA ACCTCGGAGG AATGGGTTGC TCCGCCGGAT TAATCTCAAT CGATCTCGCT 780 

AACAATCTCC TCAAAGCAAA CCCTAATTCT TACGCTGTCG TGGTAAGCAC GGAAAACATA 840 

ACCCTAAACT GGTACTTCGG AAATGACCGG TCAATGCTCC TCTGCAACTG CATCTTCCGA 900 

ATGGGCGGAG CTGCGATTCT CCTCTCTAAC CGCCGTCAAG ACCGGAAGAA GTCAAAGTAC 960 

TCGCTGGTCA ACGTCGTTCG AACACATAAA GGATCAGACG ACAAGAACTA CAATTGCGTG 1020 

TACCAGAAGG AAGACGAGAG AGGAACAATC GGTGTCTCTT TAGCTAGAGA GCTCATGTCT 1080 

GTCGCCGGAG ACGCTCTGAA AACAAACATC ACGACTTTAG GACCGATGGT TCTTCCATTG 1140 

TCAGAGCAGT TGATGTTCTT GATTTCCTTG GTCAAAAGGA AGATGTTCAA GTTAAAAGTT 1200 

AAACCGTATA TTCCGGATTT CAAGCTAGCT TTCGAGCATT TCTGTATTCA CGCAGGAGGT 1260 

AGAGCGGTTC TAGACGAAGT GCAGAAGAAT CTTGATCTCA AAGATTGGCA CATGGAACCT 1320 

TCTAGAATGA CTTTGCACAG ATTTGGTAAC ACTTCGAGTA GCTCGCTTTG GTATGAGATG 1380 

GCTTATACCG AAGCTAAGGG TCGGGTTAAA GCTGGTGACC GACTTTGGCA GATTGCGTTT 1440 

GGATCGGGTT TCAAGTGTAA TAGTGCGGTT TGGAAAGCGT TACGACCGGT TTCGACGGAG 1500 

GAGATGACCG GTAATGCTTG GGCTGGTTCG ATTGATCAAT ATCCGGTTAA AGTTGTGCAA 1560 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Asp Arg Glu Arg Leu Thr Ala Glu Met Ala Phe Arg Asp Ser Ser 

15 10 15 

Ser Ala Val lie Arg lie Arg Arg Arg Leu Pro Asp Leu Leu Thr Ser 

20 25 30 

Val Lys Leu Lys Tyr Val Lys Leu Gly Leu His Asn Ser Cys Asn Val 

35 40 45 

Thr Thr He Leu Phe Phe Leu He He Leu Pro Leu Thr Gly Thr Val 

50 55 60 

Leu Val Gin Leu Thr Gly Leu Thr Phe Asp Thr Phe Ser Glu Leu Trp 
65 70 75 80 

Ser Asn Gin Ala Val Gin Leu Asp Thr Ala Thr Arg Leu Thr Cys Leu 

85 90 95 

Val Phe Leu Ser Phe Val Leu Thr Leu Tyr Val Ala Asn Arg Ser Lys 

100 105 110 

Pro Val Tyr Leu Val Asp Phe Ser Cys Tyr Lys Pro Glu Asp Glu Arg 

115 120 125 

Lys He Ser Val Asp Ser Phe Leu Thr Met Thr Glu Glu Asn Gly Ser 

130 135 140 

Phe Thr Asp Asp Thr Val Gin Phe Gin Gin Arg He Ser Asn Arg Ala 
145 150 155 160 

Gly Leu Gly Asp Glu Thr Tyr Leu Pro Arg Gly He Thr Ser Thr Pro 

165 170 175 

Pro Lys Leu Asn Met Ser Glu Ala Arg Ala Glu Ala Glu Ala Val Met 

180 185 190 

Phe Gly Ala Leu Asp Ser Leu Phe Glu Lys Thr Gly He Lys Pro Ala 

195 200 205 

Glu Val Gly He Leu He Val Asn Cys Ser Leu Phe Asn Pro Thr Pro 

210 215 220 

Ser Leu Ser Ala Met He Val Asn His Tyr Lys Met Arg Glu Asp He 
225 230 235 240 

Lys Ser Tyr Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Leu He Ser 

245 250 255 

He Asp Leu Ala Asn Asn Leu Leu Lys Ala Asn Pro Asn Ser Tyr Ala 

260 265 270 

Val Val Val Ser Thr Glu Asn He Thr Leu Asn Trp Tyr Phe Gly Asn 
275 280 285 
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Asp Arg Ser 


Met 


Leu 


Leu 


Cys 


Asn 


Cys He 


Phe Arg 


Met 


Glv 


Gly 


Ala 


290 








295 






300 










Ala lie Leu 


Leu 


Ser 


Asn 


Arg 


Arq 


Gin Asp 


Arg Lys 


Lys 


Ser 


Lys 


Tyr 


305 






310 








315 








320 


Ser Leu Val 


Asn 


Val 


Val 


Arg 


Thr 


His Lys 


Gly Ser 


Asp 


Asp 


Lys 


Asn 






325 








330 








335 




Tyr Asn Cys 


Val 


Tyr 


Gin 


Lys 


Glu 


Asp Glu 


Arg Gly 


Thr 


He 


Gly 


Val 




340 










345 






350 






Ser Leu Ala 


Arg 


Glu 


Leu 


Met 


Ser 


Val Ala 


Gly Asp 


Ala 


Leu 


Lys 


Thr 


355 










360 






365 








Asn lie Thr 


Thr 


Leu 


Gly 


Pro 


Met 


Val Leu 


Pro Leu 


Ser 


Glu 


Gin 


Leu 


370 








375 






380 










Met Phe Leu 


He 


Ser 


Leu 


Val 


Ly9 


Arg Lys 


Met Phe 


Lys 


Leu 


Lys 


Val 


385 






390 








395 






400 


Lys Pro Tyr 


He 


Pro 


Asp 


Phe 


Lys 


Leu Ala 


Phe Glu 


His 


Phe 


Cys 


He 






405 








410 








415 




His Ala Gly 


Gly 


Arg 


Ala 


Val 


Leu 


Asp GlU'Val Gin 


Lys 


Asn 


Leu 


Asd 




420 










425 






430 






Leu Lys Asp 


Trp 


His 


Met 


Glu 


Pro 


Ser Arg 


Met Thr 


Leu 


His 


Arg 


Phe 


435 










440 






445 








Gly Asn Thr 


Ser 


Ser 


Ser 


Ser 


Leu 


Trp Tyr Glu Met 


Ala Tyr 


Thr 


Glu 


450 








455 






460 










Ala Lys Gly 


Arg 


Val 


Lys 


Ala 


Gly 


Asp Arg 


Leu Trp 


Gin 


lie 


Ala 


Phe 


465 






470 








475 








480 


Gly Ser Gly 


Phe 


Lys 


Cys 


Asn 


Ser 


Ala Val 


Trp Lys 


Ala 


Leu 


Arg 


Pro 






485 








490 








495 




Val Ser Thr 


Glu 


Glu 


Met 


Thr 


Gly 


Asn Ala Trp Ala 


Gly Ser 


He 


Asp 




500 










505 






510 






Gin Tyr Pro 


Val 


Lys 


Val 


Val 


Gin 















515 520 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGGATTACC CCATGAAGAA GGTAAAAATC TTTTTCAACT ACCTCATGGC GCATCGCTTC 60 

AAGCTCTGCT TCTTACCATT AATGGTTGCT ATAGCCGTGG AGGCGTCTCG TCTTTCCACA 120 

CAAGATCTCC AAAACTTTTA CCTCTACTTA CAAAACAACC ACACATCTCT AACCATGTTC 180 

TTCCTTTACC TCGCTCTCGG GTCGACTCTT TACCTCATGA CCCGGCCCAA ACCCGTTTAT 240 

CTCGTTGACT TTAGCTGCTA CCTCCCACCG TCGCATCTCA AAGCCAGCAC CCAGAGGATC 300 

ATGCAACACG TAAGGCTTGT ACGAGAAGCA GGCGCGTGGA AGCAAGAGTC CGATTACTTG 360 

ATGGACTTCT GCGAGAAGAT TCTAGAACGT TCCGGTCTAG GCCAAGAGAC GTACGTACCC 420 

GAAGGTCTTC AAACTTTGCC ACTACAACAG AATTTGGCTG TATCACGTAT AGAGACGGAG 480 

GAAGTTATTA TTGGTGCGGT CGATAATCTG TTTCGCAACA CGGGAATAAG CCCTAGTGAT 540 

ATAGGTATAT TGGTGGTGAA TTCAAGCACT TTTAATCCAA CACCTTCGCT ATCAAGTATC 600 

TTAGTGAATA AGTTTAAACT TAGGGATAAT ATAAAGAGCT TGAATCTTGG TGGGATGGGG 660 

TGTAGCGCTG GAGTCATCGC TATCGATGCG GCTAAGAGCT TGTTACAAGT TCATAGAAAC 720 

ACTTATGCTC TTGTGGTGAG CACGGAGAAC ATCACTCAAA ACTTGTACAT GGGTAACAAC 780 

AAATCAATGT TGGTTACAAA CTGTTTGTTC CGTATAGGTG GGGCCGCGAT TTTGCTTTCT 840 

AACCGGTCTA TAGATCGTAA ACGCGCAAAA TACGAGCTTG TTCACACCGT GCGGGTCCAT 900 

ACCGGAGCAG ATGACCGATC CTATGAATGT GCAACTCAAG AAGAGGATGA AGATGGCATA 960 

GTTGGGGTTT CCTTGTCAAA GAATCTACCA ATGGTAGCTG CAAGAACCCT AAAGATCAAT 1020 

ATCGCAACTT TGGGTCCGCT TGTTCTTCCC ATAAGCGAGA AGTTTCACTT CTTTGTGAGG 1080 

TTCGTTAAAA AGAAGTTTCT CAACCCCAAG CTAAAGCATT ACATTCCGGA TTTCAAGCTC 1140 

GCATTCGAGC ATTTCTGTAT CCATGCGGGT GGTAGAGCGC TAATTGATGA GATGGAGAAG 1200 

AATCTTCATC TAACTCCACT AGACGTTGAG GCTTCAAGAA TGACATTACA CAGGTTTGGT 1260 

AATACCTCTT CGAGCTCCAT TTGGTACGAG TTGGCTTACA CAGAAGCCAA AGGAAGGATG 132 0 

ACGAAAGGAG ATAGGATTTG GCAGATTGCG TTGGGGTCAG GTTTTAAGTG TAATAGTTCA 1380 
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GTTTGGGTGG CTCTTCGTAA CGTCAAGCCT TCTACTAATA ATCCTTGGGA ACAGTGTCTA 144 0 
CACAAATATC CAGTTGAGAT CGATATAGAT TTAAAAGAG 147 9 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 4: 

Met Asp Tyr Pro Met Lys Lys Val Lys lie Phe Phe Asn Tyr Leu Met 

1.5 10 -15 

Ala His Arg Phe Lys Leu Cys Phe Leu Pro Leu Met Val Ala lie Ala 

20 25 30 

Val Glu Ala Ser Arg Leu Ser Thr Gin Asp Leu Gin Asn Phe Tyr Leu 

35 40 45 

Tyr Leu Gin Asn Asn His Thr Ser Leu Thr Met Phe Phe Leu Tyr Leu 

50 55 60 

Ala Leu Gly Ser Thr Leu Tyr Leu Met Thr Arg Pro Lys Pro Val Tyr 
65 70 75 80 

Leu Val Asp Phe Ser Cys Tyr Leu Pro Pro Ser His Leu Lys Ala Ser 

85 90 95 

Thr Gin Arg He Met Gin His Val Arg Leu Val Arg Glu Ala Gly Ala 

100 105 110 

Trp Lys Gin Glu Ser Asp Tyr Leu Met Asp Phe Cys Glu Lys He Leu 

115 120 125 

Glu Arg Ser Gly Leu Gly Gin Glu Thr Tyr Val Pro Glu Gly Leu Gin 

130 135 140 

Thr Leu Pro Leu Gin Gin Asn Leu Ala Val Ser Arg He Glu Thr Glu 
145 150 155 160 

Glu Val He He Gly Ala Val Asp Asn Leu Phe Arg Asn Thr Gly He 

165 170 175 

Ser Pro Ser Asp He Gly He Leu Val Val Asn Ser Ser Thr Phe Asn 

180 185 190 

Pro Thr Pro Ser Leu Ser Ser He Leu Val Asn Lys Phe Lys Leu Arg 

195 200 205 

Asp Asn He Lys Ser Leu Asn Leu Gly Gly Met Gly Cys Ser Ala Gly 

210 215 220 

Val He Ala He Asp Ala Ala Lys Ser Leu Leu Gin Val His Arg Asn 
225 230 235 240 

Thr Tyr Ala Leu Val Val Ser Thr Glu Asn He Thr Gin Asn Leu Tyr 

245 250 255 

Met Gly Asn Asn Lys Ser Met Leu Val Thr Asn Cys Leu Phe Arg He 

260 265 270 

Gly Gly Ala Ala He Leu Leu Ser Asn Arg Ser He Asp Arg LyB Arg 

275 280 285 

Ala Lys Tyr Glu Leu Val His Thr Val Arg Val His Thr Gly Ala Asp 

290 295 300 

Asp Arg Ser Tyr Glu Cys Ala Thr Gin Glu Glu Asp Glu Asp Gly He 
305 310 315 320 

Val Gly Val Ser Leu Ser Lys Asn Leu Pro Met Val Ala Ala Arg Thr 

325 330 335 

Leu Lys He Asn He Ala Thr Leu Gly Pro Leu Val Leu Pro He Ser 

340 345 350 

Glu Lys Phe His Phe Phe Val Arg Phe Val Lys Lys Lys Phe Leu Asn 

355 360 365 

Pro Lys Leu Lys His Tyr He Pro Asp Phe Lys Leu Ala Phe Glu His 

370 375 380 

Phe Cys He His Ala Gly Gly Arg Ala Leu He Asp Glu Met Glu Lys 
385 390 395 400 

Asn Leu His Leu Thr Pro Leu Asp Val Glu Ala Ser Arg Met Thr Leu 
405 410 415 
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His 


Arg 


Phe 


Gly 
420 


Asn 


Thr Ser Ser 


Ser 
425 


Ser 


He 


Trp 


Tyr 


Glu Leu 
430 


Ala 


Tyr 


Thr 


Glu 


Ala 


Lys 


Gly Arg Met 


Thr 


Lys 


Gly Asp Arg He Trp Gin 






435 






440 










445 






lie 


Ala 
450 


Leu 


Gly 


Ser 


Gly Phe Lys 
455 


Cys 


Asn 


Ser 


Ser 
460 


Val 


Trp Val 


Ala 


Leu 


Arg 


Asn 


Val 


Lys 


Pro Ser Thr 


Asn 


Asn 


Pro 


Trp 


Glu 


Gin Cys 


Leu 


465 










470 






475 




480 


His 


Lys 


Tyr 


Pro 


Val 
485 


Glu lie Asp 


He 


Asp 
490 


Leu 


Lys 


Glu 







(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTACGTCAGG GTAGAACAAA GAGTAAACAC • TTAAGCAAAA CAATTTGTCC TACTCTTAGG 60 

TTATCTCCAA TGAAGAACTT AAAGATGGTT TTCTTCAAGA TCCTCTTTAT CTCTTTAATG 120 

GCAGGATTAG CCATGAAAGG ATCTAAGATC AACGTAGAAG ATCTCCAAAA GTTCTCCCTC 180 

CACCATACAC AGAACAACCT CCAAACCATA AGCCTTCTAT TGTTTCTTGT CGTTTTTGTG 240 

TGGATCCTCT ACATGTTAAC CCGACCTAAA CCCGTTTACC TTGTTGATTT CTCCTGCTAC 300 

CTTCCACCGT CGCATCTCAA GGTCAGTATC CAAACCCTAA TGGGACACGC AAGACGTGCA 360 

AGAGAAGCAG GCATGTGTTG GAAGAACAAA GAGAGCGACC ATTTAGTTGA CTTCCAGGAG 420 

AAGATTCTTG AACGTTCCGG TCTTGGTCAA GAAACCTACA TCCCCGAGGG TCTTCAGTGC 480 

TTCCCACTTC AGCAAGGCAT GGGTGCTTCA CGTAAAGAGA CGGAAGAAGT AATCTTCGGA 540 

GCTCTTGACA ATCTTTTTCG CAACACCGGT GTAAAACCTG ATGATATCGG TATATTGGTG 600 

GTGAATTCTA GCACGTTTAA TCCAACTCCA TCACTCGCCT CCATGATTGT GAACAAGTAC 660 

AAACTCAGAG ACAACATCAA GAGTTTGAAT CTTGGAGGGA TGGGTTGCAG TGCCGGAGTT 720 

ATAGCTGTTG ATGTCGCTAA GGGATTACTA CAAGTTCATA GGAACACTTA TGCTATTGTA 780 

GTAAGCACAG AGAACATCAC TCAGAACTTA TACTTGGGGA AAAACAAATC AATGCTAGTC 840 

ACAAACTGTT TGTTCCGCGT TGGTGGTGCT GCGGTTCTGC TTTCAAACAG ATCTAGAGAC 900 

CGTAACCGCG CCAAATACGA GCTTGTTCAC ACCGTACGGA TCCATACCGG ATCAGATGAT 960 

AGGTCGTTCG AATGTGCGAC ACAAGAAGAG GATGAAGATG GTATAATTGG AGTTACCTTG 1020 

ACAAAGAATC TACCTATGGT GGCTGCAAGG ACTCTTAAGA TAAATATCGC AACTTTGGGT 1080 

CCTCTTGTAC TTCCATTAAA AGAGAAGCTA GCCTTCTTTA TTACTTTTGT CAAGAAGAAG 1140 

TATTTCAAGC CAGAGTTAAG GAATTATACA CCAGATTTCA AGCTTGCCTT TGAGCATTTC 1200 

TGTATCCACG CTGGTGGAAG AGCTCTAATA GATGAGCTGG AGAAGAACCT TAAGCTTTCT 1260 

CCGTTACACG TAGAGGCGTC AAGAATGACA CTACACAGGT TTGGTAACAC TTCTTCTAGC 1320 

TCAATCTGGT ACGAGTTAGC TTATACAGAA GCTAAAGGAA GGATGAAGGA AGGAGATAGG 1380 

ATTTGGCAGA TTGCTTTGGG GTCAGGTTTT AAGTGTAACA GTTCAGTATG GGTGGCTCTG 1440 

CGAGACGTTA AGCCTTCAGC TAACAGTCCA TGGGAAGACT GTATGGATAG ATATCCGGTT 1500 

GAGATTGATA TT 1512 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu Arg Gin Gly Arg Thr Lys Ser Lys His Leu Ser Lys Thr He Cys 

15 10 15 

Pro Thr Leu Arg Leu Ser Pro Met Lys Asn Leu Lys Met Val Phe Phe 

20 25 30 
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Lys lie Leu Phe He Ser Leu Met Ala Gly Leu Ala Met Lys Gly Ser 

35 40 45 

Lys lie Asn Val Glu Asp Leu Gin Lys Phe Ser Leu His His Thr Gin 

50 55 60 

Asn Asia Leu Gin Thr He Ser Leu Leu Leu Phe Leu Val Val Phe Val 
65 70 75 80 

Trp He Leu Tyr Met Leu Thr Arg Pro Lys Pro Val Tyr Leu Val Asp 

85 90 95 

Phe Ser Cys Tyr Leu Pro Pro Ser His Leu Lys Val Ser He Gin Thr 

100 105 HO 

Leu Met Gly His Ala Arg Arg Ala Arg Glu Ala Gly Met Cys Trp Lys 

115 120 125 

Asn Lys Glu Ser Asp His Leu Val Asp Phe Gin Glu Lys He Leu Glu 

130 135 140 

Arg Ser Gly Leu Gly Gin Glu Thr Tyr He Pro Glu Gly Leu Gin Cys 
145 150 155 160 

Phe Pro Leu Gin Gin Gly Met Gly Ala Ser Arg Lys Glu Thr Glu Glu 

165 170 175 

Val He Phe Gly Ala Leu Asp Asn Leu Phe Arg Asn Thr Gly Val Lys 

180 185 190 

Pro Asp Asp He Gly He Leu Val Val Asn Ser Ser Thr Phe Asn Pro 

195 200 205 

Thr Pro Ser Leu Ala Ser Met He Val Asn Lys Tyr Lys Leu Arg Asp 

210 215 220 

Asn He Lys Ser Leu Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Val 
225 230 235 240 

He Ala Val Asp Val Ala Lys Gly Leu Leu Gin Val His Arg Asn Thr 

245 250 255 

Tyr Ala He Val Val Ser Thr Glu Asn He Thr Gin Asn Leu Tyr Leu 

260 265 270 

Gly Lys Asn Lys Ser Met Leu Val Thr Asn Cys Leu Phe Arg Val Gly 

275 280 285 

Gly Ala Ala Val Leu Leu Ser Asn Arg Ser Arg Asp Arg Asn Arg Ala 

290 295 300 

Lys Tyr Glu Leu Val His Thr Val Arg He His Thr Gly Ser Asp Asp 
305 310 315 320 

Arg Ser Phe Glu Cys Ala Thr Gin Glu Glu Asp Glu Asp Gly He He 

325 330 335 

Gly Val Thr Leu Thr Lys Asn Leu Pro Met Val Ala Ala Arg Thr Leu 

340 345 350 

Lys He Asn He Ala Thr Leu Gly Pro Leu Val Leu Pro Leu Lys Glu 

355 360 365 

Lys Leu Ala Phe Phe He Thr Phe Val Lys Lys Lys Tyr Phe Lys Pro 

370 375 380 

Glu Leu Arg Asn Tyr Thr Pro Asp Phe Lys Leu Ala Phe Glu His Phe 
385 390 395 400 

Cys He His Ala Gly Gly Arg Ala Leu He ABp Glu Leu Glu Lys Asn 

405 410 415 

Leu Lys Leu Ser Pro Leu His Val Glu Ala Ser Arg Met Thr Leu His 

420 425 430 

Arg Phe Gly Asn Thr Ser Ser Ser Ser He Trp Tyr Glu Leu Ala Tyr 

435 440 445 

Thr Glu Ala Lys Gly Arg Met Lys Glu Gly Asp Arg He Trp Gin He 

450 455 460 

Ala Leu Gly Ser Gly Phe Lys Cys Asn Ser Ser Val Trp Val Ala Leu 
465 470 475 480 

Arg Asp Val Lys Pro Ser Ala Asn Ser Pro Trp Glu Asp Cys Met Asp 

485 490 495 

Arg Tyr Pro Val Glu He Asp He 
500 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATGGGTAGAT CCAACGAGCA AGATCTGCTC TCTACCGAGA TCGTTAATCG TGGGATCGAA 60 

CCATCCGGTC CTAACGCCGG CTCACCAACG TTCTCGGTTA GGGTCAGGAG ACGTTTGCCT 120 

GATTTTCTTC AGTCGGTGAA CTTGAAGTAC GTGAAACTTG GTTACCACTA CCTCATAAAC 180 

CATGCGGTTT ATTTGGCGAC CATACCGGTT CTTGTGCTGG TTTTTAGTGC TGAGGTTGGG 240 

AGTTTAAGCA GAGAAGAGAT TTGGAAGAAG CTT*TGGGACT ATGATCTTGC AACTGTTATC 300 

GGATTCTTCG GTGTCTTTGT TTTAACCGCT TGTGTCTACT TCATGTCTCG TCCTCGCTCT 36 0 

GTTTATCTTA TTGATTTCGC TTGTTACAAG CCCTCCGATG AACACAAGGT GACAAAAGAA 420 

GAGTTCATAG AACTAGCGAG AAAATCAGGG AAGTTCGACG AAGAGACACT CGGTTTCAAG 4 80 

AAGAGGATCT TACAAGCCTC AGGCATAGGC GACGAGACAT ACGTCCCAAG ATCCATCTCT 540 

TCATCAGAAA ACATAACAAC GATGAAAGAA GGTCGTGAAG AAGCCTCTAC AGTGATCTTT 600 

GGAGCACTAG ACGAACTCTT CGAGAAGACA CGTGTAAAAC CTAAAGACGT TGGTGTCCTT 660 

GTGGTTAACT GTAGCATTTT CAACCCGACA CCGTCGTTGT CCGCAATGGT GATAAACCAT 720 

TACAAGATGA GAGGGAACAT ACTTAGTTAC AACCTTGGAG GGATGGGATG TTCGGCTGGA 780 

ATCATAGCTA TTGATCTTGC TCGTGACATG CTTCAGTCTA ACCCTAATAG TTATGCTGTT 840 

GTTGTGAGTA CTGAGATGGT TGGGTATAAT TGGTACGTGG GAAGTGACAA GTCAATGGTT 900 

ATACCTAATT GTTTCTTTAG GATGGGTTGT TCTGCCGTTA TGCTCTCTAA CCGTCGTCGT 960 

GACTTTCGCC ATGCTAAGTA CCGTCTCGAG CACATTGTCC GAACTCATAA GGCTGCTGAC 1020 

GACCGTAGCT TCAGGAGTGT GTACCAGGAA GAAGATGAAC AAGGATTCAA GGGGTTGAAG 1080 

ATAAGTAGAG ACTTAATGGA AGTTGGAGGT GAAGCTCTCA AGACAAACAT CACTACCTTA 114 0 

GGTCCTCTTG TCCTACCTTT CTCCGAGCAG CTTCTCTTCT TTGCTGCTTT GGTCCGCCGA 1200 

ACATTCTCAC CTGCTGCCAA AACGTCCACA ACCACTTCCT TCTCTACTTC CGCCACCGCA 1260 

AAAACCAATG GAATCAAGTC TTCCTCTTCC GATCTGTCCA AGCCATACAT CCCGGACTAC 1320 

AAGCTCGCCT TCGAGCATTT TTGCTTCCAC GCGGCAAGCA AAGTAGTGCT TGAAGAGCTT 1380 

CAAAAGAATC TAGGCTTGAG TGAAGAGAAT ATGGAGGCTT CTAGGATGAC ACTTCACAGG 1440 

TTTGGAAACA CTTCTAGCAG TGGAATCTGG TATGAGTTGG CTTACATGGA GGCCAAGGAA 1500 

AGTGTTCGTA GAGGCGATAG GGTTTGGCAG ATCGCTTTCG GTTCTGGTTT TAAGTGTAAC 1560 

AGTGTGGTGT GGAAGGCAAT GAGGAAGGTG AAGAAGCCAA CCAGGAACAA TCCTTGGGTG 1620 

GATTGCATCA ACCGTTACCC TGTGCCTCTC 1650 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISE 

(A) LENGTH: 550 amino ac 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 





(xi) SEQUENCE 


DESCRIPTION: 


Met 


Gly Arg 


Ser 


Asn 


Glu 


Gin 


Asp 


1 






5 








Arg 


Gly lie 


Glu 


Pro 


Ser 


Gly 


Pro 






20 










Val 


Arg Val 


Arg 


Arg 


Arg 


Leu 


Pro 




35 










40 


Lys 


Tyr Val 


Lys 


Leu 


Gly 


Tyr 


His 


50 








55 




Leu 


Ala Thr 


lie 


Pro 


Val 


Leu 


Val 


65 








70 






Ser 


Leu Ser 


Arg 


Glu 


Glu 


lie 


Trp 








85 








Ala 


Thr Val 


lie 


Gly 


Phe 


Phe 


Gly 






100 











SEQ ID 


NO:8: 




Leu 


Leu 


Ser 


Thr Glu He Val 


Asn 




10 




15 




Asn 


Ala 


Gly 


Ser Pro Thr Phe 


Ser 


25 




30 




Asp 


Phe 


Leu 


Gin Ser Val Asn 


Leu 






45 




Tyr 


Leu 


He 


Asn His Ala Val 


Tyr 






60 




Leu 


Val 


Phe 


Ser Ala Glu Val Gly 






75 




80 


Lys 


Lys 


Leu 


Trp Asp Tyr Asp 


Leu 




90 




95 




Val 


Phe 


Val 


Leu Thr Ala Cys 


Val 


105 






110 
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Tyr Phe Met Ser Arg Pro Arg Ser Val Tyr Leu lie Asp Phe Ala Cys 

115 120 125 

Tyr Lys Pro Ser Asp Glu His Lys Val Thr Lys Glu Glu Phe lie Glu 

130 135 140 

Leu Ala Arg Lys Ser Gly Lys Phe Asp Glu Glu Thr Leu Gly Phe Lys 
145 150 155 160 

Lys Arg lie Leu Gin Ala Ser Gly lie Gly Asp Glu Thr Tyr Val Pro 

165 170 175 

Arg Ser lie Ser Ser Ser Glu Asn lie Thr Thr Met Lys Glu Gly Arg 

180 185 190 

Glu Glu Ala Ser Thr Val He Phe Gly Ala Leu Asp Glu Leu Phe Glu 

195 200 205 

Lys Thr Arg Val Lys Pro Lys Asp Val Gly Val Leu Val Val Asn Cys 

210 215 220 

Ser He Phe Asn Pro Thr Pro Ser Leu Ser Ala Met Val He Asn His 
225 230 235 240 

Tyr Lys Met Arg Gly Asn He Leu Ser Tyr Asn Leu Gly Gly Met Gly 

245 250 255 

Cys Ser Ala Gly He He Ala He Asp Leu Ala Arg Asp Met Leu Gin 

260 265 270 

Ser Asn Pro Asn Ser Tyr Ala Val Val Val Ser Thr Glu Met Val Gly 

275 280 285 

Tyr Asn Trp Tyr Val Gly Ser Asp Lys Ser Met Val He Pro Asn Cys 

290 295 300 

Phe Phe Arg Met Gly Cys Ser Ala Val Met Leu Ser Asn Arg Arg Arg 
305 310 315 320 

Asp Phe Arg His Ala Lys Tyr Arg Leu Glu His He Val Arg Thr His 

325 330 335 

Lys Ala Ala Asp Asp Arg Ser Phe Arg Ser Val Tyr Gin Glu Glu Asp 

340 345 350 

Glu Gin Gly Phe Lys Gly Leu Lys He Ser Arg Asp Leu Met Glu Val 

355 360 365 

Gly Gly Glu Ala Leu Lys Thr Asn He Thr Thr Leu Gly Pro Leu Val 

370 375 380 

Leu Pro Phe Ser Glu Gin Leu Leu Phe Phe Ala Ala Leu Val Arg Arg 
385 390 395 400 

Thr Phe Ser Pro Ala Ala Lys Thr Ser Thr Thr Thr Ser Phe Ser Thr 

405 410 415 

Ser Ala Thr Ala Lys Thr Asn Gly He Lys Ser Ser Ser Ser Asp Leu 

420 425 430 

Ser Lys Pro Tyr He Pro Asp Tyr Lys Leu Ala Phe Glu His Phe Cys 

435 440 445 

Phe His Ala Ala Ser Lys Val Val Leu Glu Glu Leu Gin Lys Asn Leu 

450 455 460 

Gly Leu Ser Glu Glu Asn Met Glu Ala Ser Arg Met Thr Leu His Arg 
465 470 475 480 

Phe Gly Asn Thr Ser Ser Ser Gly lie Trp Tyr Glu Leu Ala Tyr Met 

485 490 495 

Glu Ala Lys Glu Ser Val Arg Arg Gly Asp Arg Val Trp Gin He Ala 

500 505 510 

Phe Gly Ser Gly Phe Lys Cys Asn Ser Val Val Trp Lys Ala Met Arg 

515 520 525 

Lys Val Lys Lys Pro Thr Arg Asn Asn Pro Trp Val Asp Cys He Asn 

530 535 540 

Arg Tyr Pro Val Pro Leu 
545 550 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



TCGAGCTACG TCAGGGCTTT TATATGCACA AATTCTCATA AAGTTTTCAA TTTTATTCCA 60 

TTTTTCTCGG AAGCCATGGA AGCTGCTAAT GAGCCTGTTA ATGGCGGATC CGTACAGATC 120 

CGAACAGAGA ACAACGAAAG ACGAAAGCTT CCTAATTTCT TACAAAGCGT CAACATGAAA 180 

TACGTCAAGC TAGGTTATCA TTACCTCATT ACTCATCTCT TCAAGCTCTG TTTGGTTCCA 240 

TTAATGGCGG TTTTAGTCAC AGAGATCTCT CGATTAACAA CAGACGATCT TTACCAGATT 3 00 

TGGCTTCATC TCCAATACAA TCTCGTTGCT TTCATCTTTC TCTCTGCTTT AGCTATCTTT 3 60 

GGCTCCACCG TTTACATCAT GAGTCGTCCC AGATCTGTTT ATCTCGTTGA TTACTCTTGT 420 

TATCTTCCTC CGGAGAGTCT TCAGGTTAAG TATCAGAAGT TTATGGATCA TTCTAAGTTG 480 

ATTGAAGATT TCAATGAGTC ATCTTTAGAG TTTCAGAGGA AGATTCTTGA ACGTTCTGGT 540 

TTAGGAGAAG AGACTTATCT CCCTGAAGCT TTACATTGTA TCCCTCCGAG GCCTACGATG 600 

ATGGCGGCTC GTGAGGAATC TGAGCAGGTA ATGTTTGGTG CTCTTGATAA GCTTTTCGAG 660 

AATACCAAGA TTAACCCTAG GGATATTGGT GTGTTGGTTG TGAATTGTAG CTTGTTTAAT 720 

CCTACACCTT CGTTGTCAGC TATGATTGTT AACAAGTATA AGCTTAGAGG GAATGTTAAG 780 

AGTTTTAACC TTGGTGGAAT GGGGTGTAGT GCTGGTGTTA TCTCTATCGA TTTAGCTAAA 840 

GATATGTTGC AAGTTCATAG GAATACTTAT GCTGTTGTGG TTAGTACTGA GAAfcATTACT 900 

CAGAATTGGT ATTTTGGGAA TAAGAAGGCT ATGTTGATTC CGAATTGTTT GTTTCGTGTT 960 

GGTGGTTCGG CGATTTTGTT GTCGAACAAG GGGAAAGATC GTAGACGGTC TAAGTATAAG 1020 

CTTGTTCATA CCGTTAGGAC TCATAAAGGA GCTGTTGAGA AGGCTTTCAA CTGTGTTTAC 10 80 

CAAGAGCAAG ATGATAATGG GAAGACCGGG GTTTCGTTGT CGAAAGATCT TATGGCTATA 1140 

GCTGGGGAAG CTCTTAAGGC GAATATCACT ACTTTAGGTC CTTTGGTTCT TCCTATAAGT 1200 

GAGCAGATTC TGTTTTTCAT GACTTTGGTT ACGAAGAAAC TGTTTAACTC GAAGCTGAAG 1260 

CCGTATATTC CGGATTTCAA GCTTGCGTTT GATCATTTCT GTATCCATGC TGGTGGTAGA 1320 

GCTGTGATTG ATGAGCTTGA GAAGAATCTG CAGCTTTCGC AGACTCATGT CGAGGCATCC 13 80 

AGAATGACAC TGCACAGATT TGGAAACACT TCTTCGAGCT CGATTTGGTA TGAACTGGCT 1440 

TACATAGAGG CTAAAGGTAG GATGAAGAAA GGAAACCGGG TTTGGCAGAT TGCTTTTGGA 1500 

AGTGGGTTTA AGTGTAACAG TGCAGTTTGG GTGGCTCTAA ACAATGTCAA GCCTTCGGTT 1560 

AGTAGTCCGT GGGAACACTG CATCGACCGA TATCCGGTTA AGCTCGACTT C 1611 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Ser Tyr Val Arg Ala Phe lie Cys Thr Asn Ser His Lys Val Phe 

15 10 15 

Asn Phe He Pro Phe Phe Ser Glu Ala Met Glu Ala Ala Asn Glu Pro 

20 25 30 

Val Asn Gly Gly Ser Val Gin He Arg Thr Glu Asn Asn Glu Arg Arg 

35 40 45 

Lys Leu Pro Asn Phe Leu Gin Ser Val Asn Met Lys Tyr Val Lys Leu 

50 55 60 

Gly Tyr His Tyr Leu He Thr His Leu Phe Lys Leu Cys Leu Val Pro 
65 70 75 80 

Leu Met Ala Val Leu Val Thr Glu He Ser Arg Leu Thr Thr Asp Asp 

85 90 95 

Leu Tyr Gin He Trp Leu His Leu Gin Tyr Asn Leu Val Ala Phe He 

100 105 HO 

Phe Leu Ser Ala Leu Ala He Phe Gly Ser Thr Val Tyr He Met Ser 

115 120 125 

Arg Pro Arg Ser Val Tyr Leu Val Asp Tyr Ser Cys Tyr Leu Pro Pro 

130 135 140 

Glu Ser Leu Gin Val Lys Tyr Gin Lys Phe Met Asp His Ser Lys Leu 
145 150 155 160 

He Glu Asp Phe Asn Glu Ser Ser Leu Glu Phe Gin Arg Lys He Leu 

165 170 175 

Glu Arg Ser Gly Leu Gly Glu Glu Thr Tyr Leu Pro Glu Ala Leu His 
180 185 190 
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Cys lie Pro 


Pro 


Arg 


Pro 


Thr 


Met 


Met 


Ala 


Ala 


Arg Glu 


Glu 


Ser 


Glu 


195 










200 








205 








Gin Val Met 


Phe 


Gly 


Ala 


Leu 


Asp 


Lys 


Leu 


Phe 


Glu Asn 


Thr 


Lys 


He 


210 








215 










220 






Asn Pro Arg 


Asp 


He 


Gly 


Val 


Leu 


Val 


Val 


Asn 


Cys Ser 


Leu 


Phe 


Asn 


225 






230 










235 






240 


Pro Thr Pro 


Ser 


Leu 


Ser 


Ala 


Met 


He 


Val 


Asn 


Lys Tyr 


Lys 


Leu 


Arg 






245 










250 






255 


Gly Asn Val 


Lys 


Ser 


Phe 


Asn 


Leu 


Gly 


Gly 


Met 


Gly Cys 


Ser 


Ala 


Gly 




260 










265 








270 




Val He Ser 


He 


Asp 


Leu 


Ala 


Lys 


Asp 


Met 


Leu 


Gin Val 


His 


Arg 


Asn 


275 










280 








285 






Thr Tyr Ala 


Val 


Val 


Val 


Ser 


Thr 


Glu 


Asn 


He 


Thr Gin 


Asn 


Trp 


Tvr 


290 








295 










300 




Phe Gly Asn 


Lys 


Lys 


Ala 


Met 


Leu 


He 


Pro 


Asn 


Cys Leu 


Phe 


Ara 


Val 


305 






310 










315 






320 


Gly Gly Ser 


Ala 


He 


Leu 


Leu 


Ser 


Asn 


Lys 


Gly 


Lys Asp 


Arg 


Ara 


Arg 






325 










330 








335 


Ser Lys Tyr 


Lys 


Leu 


Val 


His 


Thr 


Val 


Arg 


Thr 


His Lys 


Gly 


Ala 


Val 




340 










345 








350 






Glu Lys Ala 


Phe 


Asn 


Cys 


Val 


Tyr 


Gin 


Glu 


Gin 


Asp Asp 


Asn 


Gly 


Lys 


355 










360 








365 




Thr Gly Val 


Ser 


Leu 


Ser 


Lys 


Asp 


Leu 


Met 


Ala 


He Ala 


Gly 


Glu 


Ala 


370 








375 










380 






Leu Lys Ala 


Asn 


He 


Thr 


Thr 


Leu 


Gly 


Pro 


Leu 


Val Leu 


Pro 


He 


Ser 


385 






390 










395 








400 


Glu Gin He 


Leu 


Phe 


Phe 


Met 


Thr 


Leu 


Val 


Thr 


Lys Lys 


Leu 


Phe 


Asn 






405 










410 






415 




Ser Lys Leu 


Lys 


Pro 


Tyr 


He 


Pro 


Asp 


Phe 


Lys 


Leu Ala 


Phe 


Asp 


His 




420 










425 








430 






Phe Cys He 


His 


Ala 


Gly 


Gly 


Arg 


Ala 


Val 


He 


Asp Glu 


Leu 


Glu 


Lys 


435 










440 








445 








Asn Leu Gin 


Leu 


Ser 


Gin 


Thr 


His 


Val 


Glu 


Ala 


Ser Arg 


Met 


Thr 


Leu 


450 








455 










460 








His Arg Phe 


Gly 


Asn 


Thr 


Ser 


Ser 


Ser 


Ser 


He 


Trp Tyr 


Glu 


Leu 


Ala 


465 






470 










475 






480 


Tyr He Glu 


Ala 


Lys 


Gly 


Arg 


Met 


Lys 


Lys 


Gly 


Asn Arg 


Val 


Trp 


Gin 






485 










490 








495 




He Ala Phe 


Gly 


Ser 


Gly 


Phe 


Lys 


Cys 


Asn 


Ser 


Ala Val 


Trp 


Val 


Ala 




500 










505 








510 






Leu Asn Asn 


Val 


Lys 


Pro 


Ser 


Val 


Ser 


Ser 


Pro 


Trp Glu 


His 


Cys 


He 


515 










520 








525 








Asp Arg Tyr 


Pro 


Val 


Lys 


Leu 


Asp 


Phe 














530 








535 



















(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCTCCGACGA TGCCTCAGGC ACCGATGCCA GAGTTCTCTA GCTCGGTGAA GCTCAAGTAC 60 

GTGAAACTTG GTTACCAATA TTTGGTTAAC CATTTCTTGA GT1TTCTTTT GATCCCGATC 120 

ATGGCTATTG TCGCCGTTGA GCTTCTTCGG ATGGGTCCTG AAGAGATCCT TAATGTTTGG 180 

AATTCACTCC AGTTTGACCT AGTTCAGGTT CTATGTTCTT CCTTCTTTGT CATCTTCATC 240 

TCCACTGTTT ACTTCATGTC CAAGCCACGC ACCATCTACC TCGTTGACTA TTCTTGTTAC 300 

AAGCCACCTG TCACGTGTCG TGTCCCCTTC GCAACTTTCA TGGAACACTC TCGTTTGATC 360 

CTCAAGGACA AGCCTAAGAG CGTCGAGTTC CAAATGAGAA TCCTTGAACG TTCTGGCCTC 420 

GGTGAGGAGA CTTGTCTCCC TCCGGCTATT CATTATATTC CTCCCACACC AACCATGGAC 480 

GCGGCTAGAA GCGAGGCTCA GATGGTTATC TTCGAGGCCA TGGACGATCT TTTCAAGAAA 540 

ACCGGTCTTA AACCTAAAGA CGTCGACATC CTTATCGTCA ACTGCTCTCT TTTCTCTCCC 600 
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ACACCATCGC TCTCAGCTAT GGTCATCAAC AAATATAAGC TTAGGAGTAA TATCAAGAGC 660 

TTCAATCTTT CGGGGATGGG CTGCAGCGCG GGCCTGATCT CAGTTGATCT AGCCCGCGAC 720 

TTGCTCCAAG TTCATCCCAA TTCAAATGCA ATCATCGTCA GCACGGAGAT CATAACGCCT 7 80 

AATTACTATC AAGGCAACGA GAGAGCCATG TTGTTACCCA ATTGTCTCTT CCGCATGGGT 84 0 

GCGGCAGCCA TACACATGTC AAACCGCCGG TCTGACCGGT GGCGAGCCAA ATACAAGCTT 900 

TCCCACCTCG TCCGGACACA CCGTGGCGCT GACGACAAGT CTTTCTACTG TGTCTACGAA 960 

CAGGAAGACA AAGAAGGACA CGTTGGCATC AACTTGTCCA AAGATCTCAT GGCCATCGCC 1020 

GGTGAAGCCC TCAAGGCAAA CATCACCACA ATAGGTCCTT TGGTCCTACC GGCGTCAGAA 1080 

CAACTTCTCT TCCTCACGTC CCTAATCGGA CGTAAAATCT TCAACCCGAA ATGGAAACCA 1140 

TACATACCGG ATTTCAAGCT GGCCTTCGAA CACTTTTGCA TTCACGCAGG AGGCAGAGCG 1200 

GTGATCGACG AGCTCCAAAA GAATCTACAA CTATCAGGAG AACACGTTGA GGCCTCAAGA 1260 

ATGACACTAC ATCGTTTTGG TAACACGTCA TCTTCATCGT TATGGTACGA GCTTAGCTAC 1320 

ATCGAGTCTA AAGGGAGAAT GAGGAGAGGC GATCGCGTTT GGCAAATCGC GTTTGGGAGT 1380 

GGTTTCAAGT GTAACTCTGC CGTGTGGAAG TGTAACCGTA CGATTAAGAC ACCTAAGGAC 1440 

GGACCATGGT CCGATTGTAT CGACCGTTAC CCTGTCTTTA TTCCCGAAGT TGTCAAACTC 15 00 

TA a 1502 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 500 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Ser Pro Thr Met Pro Gin Ala Pro Met Pro Glu Phe Ser Ser Ser Val 

15 10 15 

Lys Leu Lys Tyr Val Lys Leu Gly Tyr Gin Tyr Leu Val Asn His Phe 

20 25 30 

Leu Ser Phe Leu Leu lie Pro lie Met Ala lie Val Ala Val Glu Leu 

35 40 45 

Leu Arg Met Gly Pro Glu Glu lie Leu Asn Val Trp Asn Ser Leu Gin 

50 55 60 

Phe Asp Leu Val Gin Val Leu Cys Ser Ser Phe Phe Val He Phe He 
65 70 75 80 

Ser Thr Val Tyr Phe Met Ser Lys Pro Arg Thr He Tyr Leu Val Asp 

85 90 95 

Tyr Ser Cys Tyr Lys Pro Pro Val Thr Cys Arg Val Pro Phe Ala Thr 

100 105 HO 

Phe Met Glu His Ser Arg Leu He Leu Lys Asp Lys Pro Lys Ser Val 

115 120 125 

Glu Phe Gin Met Arg He Leu Glu Arg Ser Gly Leu Gly Glu Glu Thr 

130 135 140 

Cys Leu Pro Pro Ala He His Tyr He Pro Pro Thr Pro Thr Met Asp 
145 150 155 160 

Ala Ala Arg Ser Glu Ala Gin Met Val He Phe Glu Ala Met Asp Asp 

165 170 175 

Leu Phe Lys Lys Thr Gly Leu Lys Pro Lys Asp Val Asp He Leu He 

180 185 190 

Val Asn Cys Ser Leu Phe Ser Pro Thr Pro Ser Leu Ser Ala Met Val 

195 200 205 

He Asn Lys Tyr Lys Leu Arg Ser Asn He Lys Ser Phe Asn Leu Ser 

210 215 220 

Gly Met Gly Cys Ser Ala Gly Leu lie Ser Val Asp Leu Ala Arg Asp 
225 230 235 240 

Leu Leu Gin Val His Pro Asn Ser Asn Ala He He Val Ser Thr Glu 

245 250 255 

He He Thr Pro Asn Tyr Tyr Gin Gly Asn Glu Arg Ala Met Leu Leu 

260 265 270 

Pro Asn Cys Leu Phe Arg Met Gly Ala Ala Ala He His Met Ser Asn 

275 280 285 

Arg Arg Ser Asp Arg Trp Arg Ala Lys Tyr Lys Leu Ser His Leu Val 
290 295 300 
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Arg 


Thr 


His 


Arg 


Gly 


Ala 


Asp 


Asp 


Lys 


Ser 


Phe Tyr Cys 


Val 


Tyr 


Glu 


305 










310 










315 






320 


Gin 


Glu 


Asp 


Lys 


Glu 


Gly 


His 


Val 


Gly 


He 


Asn Leu Ser 


Lys 


Asp 


Leu 










325 










330 




335 




Met 


Ala 


He 


Ala 


Gly 


Glu 


Ala 


Leu 


Lys 


Ala 


Asn He Thr 


Thr 


He 


Gly 








340 










345 






350 




Pro 


Leu 


Val 


Leu 


Pro 


Ala 


Ser 


Glu 


Gin 


Leu 


Leu Phe Leu 


Thr 


Ser 


Leu 






355 










360 






365 








lie 


Gly 


Arg 


Lys 


He 


Phe 


Asn 


Pro 


Lys 


Trp 


Lys Pro Tyr 


He 


Pro 


Asp 




370 










375 








380 






Phe 


Lys 


Leu 


Ala 


Phe 


Glu 


His 


Phe 


Cys 


He 


His Ala Gly 


Gly Arg 


Ala 


385 










390 










395 






400 


Val 


He 


Asp 


Glu 


Leu 


Gin 


Lys 


Asn 


Leu 


Gin 


Leu Ser Gly 


Glu 


His 


Val 










405 










410 




415 




Glu 


Ala 


Ser 


Arg 


Met 


Thr 


Leu 


His 


Arg 


Phe 


Gly Asn Thr 


Ser 


Ser 


Ser 








420 










425 






430 






Ser' 


Leu 


Trp 


Tyr 


Glu 


Leu 


Ser 


Tyr 


He 


Glu 


Ser Lys *Gly 


Arg Met 


Arg 






435 










440 






445 






Arg 


Gly 


Asp 


Arg 


Val 


Trp 


Gin 


He 


Ala 


Phe 


Gly Ser Gly 


Phe 


Lys 


Cys 




450 










455 








460 






Asn 


Ser 


Ala 


Val 


Trp 


Lys 


Cys 


Asn 


Arg 


Thr 


He Lys Thr 


Pro 


Lys 


Asp 


465 










470 










475 






480 


Gly 


Pro 


Trp 


Ser 


Asp 


Cys 


He 


Asp 


Arg 


Tyr 


Pro Val Phe 


He 


Pro 


Glu 










485 










490 






495 




Val 


Val 


Lys 


Leu 


























500 























(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

ATGGACGGTG CCGGAGAATC ACGACTCGGT GGTGATGGTG GTGGTGATGG TTCTGTTGGA 60 

GTTCAGATCC GACAAACACG GATGCTACCG GATTTTCTCC AGAGCGTGAA TCTCAAGTAT 120 

GTGAAATTAG GTTACCATTA CTTAATCTCA AATCTCTTGA CTCTCTGTTT ATTCCCTCTC 180 

GCCGTTGTTA TCTCCGTCGA AGCCTCTCAG ATGAACCCAG ATGATCTCAA ACAGCTCTGG 240 

ATCCATCTAC AATACAATCT GGTTAGTATC ATCATCTGTT CAGCGATTCT AGTCTTCGGG 300 

TTAACGGTTT ATGTTATGAC CCGACCTAGA CCCGTTTACT TGGTTGATTT CTCTTGTTAT 360 

CTCCCACCTG ATCATCTCAA AGCTCCTTAC GCTCGGTTCA TGGAACATTC TAGACTCACC 420 

GGAGATTTCG ATGACTCTGC TCTCGAGTTT CAACGCAAGA TCCTTGAGCG TTCTGGTTTA 480 

GGGGAAGACA CTTATGTCCC TGAAGCTATG CATTATGTTC CACCGAGAAT TTCAATGGCT 540 

GCTGCTAGAG AAGAAGCTGA ACAAGTCATG TTTGGTGCTT TAGATAACCT TTTCGCTAAC 600 

ACTAATGTGA AACCAAAGGA TATTGGAATC CTTGTTGTGA ATTGTAGTCT CTTTAATCCA 660 

ACTCCTTCGT TATCTGCAAT GATTGTGAAC AAGTATAAGC TTAGAGGTAA CATTAGAAGC 720 

TACAATCTAG GCGGTATGGG TTGCAGCGCG GGAGTTATCG CTGTGGATCT TGCTAAAGAC 780 

ATGTTGTTGG TACATAGGAA CACTTATGCG GTTGTTGTTT CTACTGAGAA CATTACTCAG 840 

AATTGGTATT TTGGTAACAA GAAATCGATG TTGATACCGA ACTGCTTGTT TCGAGTTGGT 900 

GGCTCTGCGG TTTTGCTATC GAACAAGTCG AGGGACAAGA GACGGTCTAA GTACAGGCTT 960 

GTACATGTAG TCAGGACTCA CCGTGGAGCA GATGATAAAG CTTTCCGTTG TGTTTATCAA 1020 

GAGCAGGATG ATACAGGGAG AACCGGGGTT TCGTTGTCGA AAGATCTAAT GGCGATTGCA 1080 

GGGGAAACTC TCAAAACCAA TATCACTACA TTGGGTCCTC TTGTTCTACC GATAAGTGAG 1140 

CAGATTCTCT TCTTTATGAC TCTAGTTGTG AAGAAGCTCT TTAACGGTAA AGTGAAACCG 1200 

TATATCCCGG ATTTCAAACT TGCTTTCGAG CATTTCTGTA TCCATGCTGG TGGAAGAGCT 1260 

GTGATCGATG AGTTAGAGAA GAATCTGCAG CTTTCACCAG TTCATGTCGA GGCTTCGAGG 1320 

ATGACTCTTC ATCGATTTGG TAACACATCT TCGAGCTCCA TTTGGTATGA ATTGGCTTAC 13 80 

ATTGAAGCGA AGGGAAGGAT GCGAAGAGGT AATCGTGTTT GGCAAATCGC GTTCGGAAGT 1440 

GGATTTAAAT GTAATAGCGC GATTTGGGAA GCATTAAGGC ATGTGAAACC TTCGAACAAC 1500 

AGTCCTTGGG AAGATTGTAT TGACAAGTAT CCGGTAACTT TAAGTTAT 154 8 



(2) INFORMATION FOR SEQ ID NO: 14: 



WO 98/54954 



PCT/US98/U384 



- 49 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Gly Ala Gly Glu Ser Arg Leu Gly Gly Asp Gly Gly Gly Asp 

15 10 15 

Gly Ser Val Gly Val Gin He Arg Gin Thr Arg Met Leu Pro Asp Phe 

20 25 30 

Leu Gin Ser Val Asn Leu Lys Tyr Val Lys Leu Gly Tyr His Tyr Leu 

35 40 45 

He Ser Asn Leu Leu Thr Leu Cys Leu Phe Pro Leu Ala Val Val lie 

50 55 60 

Ser Val Glu Ala Ser Gin Met Asn Pro Asp Asp Leu Lys Gin Leu Trp 
65 70 75 80 

He His Leu Gin Tyr Asn Leu Val Ser He He He Cys Ser Ala lie 

85 90 95 

Leu Val Phe Gly Leu Thr Val Tyr Val Met Thr Arg Pro Arg Pro Val 

100 105 110 

Tyr Leu Val Asp Phe Ser Cys Tyr Leu Pro Pro Asp His Leu Lys Ala 

115 120 125 

Pro Tyr Ala Arg Phe Met Glu His Ser Arg Leu Thr Gly Asp Phe Asp 

130 135 140 

Asp Ser Ala Leu Glu Phe Gin Arg Lys He Leu Glu Arg Ser Gly Leu 
145 150 155 160 

Gly Glu Asp Thr Tyr Val Pro Glu Ala Met His Tyr Val Pro Pro Arg 

165 170 175 

He Ser Met Ala Ala Ala Arg Glu Glu Ala Glu Gin Val Met Phe Gly 

180 185 190 

Ala Leu Asp Asn Leu Phe Ala Asn Thr Asn Val Lys Pro Lys Asp He 

195 200 205 

Gly He Leu Val Val Asn Cys Ser Leu Phe Asn Pro Thr Pro Ser Leu 

210 215 220 

Ser Ala Met He Val Asn Lys Tyr Lys Leu Arg Gly Asn He Arg Ser 
225 230 235 240 

Tyr Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Val He Ala Val Asp 

245 250 255 

Leu Ala Lys Asp Met Leu Leu Val His Arg Asn Thr Tyr Ala Val Val 

260 265 270 

Val Ser Thr Glu Asn He Thr Gin Asn Trp Tyr Phe Gly Asn Lys Lys 

275 280 285 

Ser Met Leu He 'Pro Asn Cys Leu Phe Arg Val Gly Gly Ser Ala Val 

290 295 300 

Leu Leu Ser Asn Lys Ser Arg Asp Lys Arg Arg Ser Lys Tyr Arg Leu 
305 310 315 320 

Val His Val Val Arg Thr His Arg Gly Ala Asp Asp Lys Ala Phe Arg 

325 330 335 

Cys Val Tyr Gin Glu Gin Asp Asp Thr Gly Arg Thr Gly Val Ser Leu 

340 345 350 

Ser Lys Asp Leu Met Ala He Ala Gly Glu Thr Leu Lys Thr Asn He 

355 360 365 

Thr Thr Leu Gly Pro Leu Val Leu Pro He Ser Glu Gin He Leu Phe 

370 375 380 

Phe Met Thr Leu Val Val Lys Lys Leu Phe Asn Gly Lys Val Lys Pro 
385 390 395 400 

Tyr He Pro Asp Phe Lys Leu Ala Phe Glu His Phe Cys He His Ala 

405 410 415 

Gly Gly Arg Ala Val He Asp Glu Leu Glu Lys Asn Leu Gin Leu Ser 

420 425 430 

Pro Val His Val Glu Ala Ser Arg Met Thr Leu His Arg Phe Gly Asn 
435 440 445 
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Thr Ser Ser Ser Ser lie Trp Tyr Glu Leu Ala Tyr lie Glu Ala Lys 

450 455 460 

Gly Arg Met Arg Arg Gly Asn Arg Val Trp Gin lie Ala Phe Gly Ser 
465 470 475 480 

Gly Phe Lys Cys Asn Ser Ala lie Trp Glu Ala Leu Arg His Val Lys 

485 490 495 

Pro Ser Asn Asn Ser Pro Trp Glu Asp Cys He Asp Lys Tyr Pro Val 

500 505 510 

Thr Leu Ser Tyr 
515 
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WHAT IS CLAIMED IS: 

1. An isolated polynucleotide encoding a polypeptide 
having an amino acid sequence selected from the group 
consisting of: an amino acid sequence substantially 
identical to SEQ ID NO: 2, an amino acid sequence 
substantially identical to SEQ ID N0:4, an amino acid 
sequence substantially identical to SEQ ID NO : 6 , an amino 
acid sequence substantially identical to SEQ ID NO: 8, an 
amino acid sequence substantially identical to SEQ ID 

NO: 10, an amino acid sequence substantially identical to 
SEQ ID NO: 12, and an amino acid sequence substantially 
identical to SEQ ID NO: 14. 

2. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO:2. 

3. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO:4. 

4. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 6. 

5. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO:8. 

6. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 10. 

7. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 12. 

8. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 14. 
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9. An isolated polynucleotide, wherein said 

polynucleotide is selected from the group consisting of 

a) SEQ ID NO:lj 

b) SEQ ID NO: 3, 

c) SEQ ID N0:5, 

d) SEQ ID NO: 7, 

e) SEQ ID NO: 9, 

f) SEQ ID NO:ll; 

g) SEQ ID NO: 13; 

h) an RNA analog of SEQ ID N0:1, 

i) an RNA analog of SEQ ID NO: 3, 
j) an RNA analog of SEQ ID NO: 5; 
k) an RNA analog of SEQ ID NO: 7; 
1) an RNA analog of SEQ ID NO: 9; 
m) an RNA analog of SEQ ID NO: 11; 
n) an RNA analog of SEQ ID NO: 13; 

o) a polynucleotide having a nucleic acid sequence 
complementary to a) , b) , c) , d) , e) , f ) , g) , h) , i) , j), 
k) , 1) , m) , or n) ; and 

p) a nucleic acid fragment of a) , b) , c) , d) , e) , 
f ) / g) , h) , i) , j) , k) , 1) , m) , n) , or o) that is at 
least 15 nucleotides in length and that hybridizes under 
stringent conditions to genomic DNA encoding the 
polypeptide of SEQ ID NO: 2, SEQ ID N0:4, SEQ ID NO: 6 , SEQ 
ID N0:8, SEQ ID NO:10, SEQ ID NO:12, or SEQ ID N0:14. 



10. An isolated polypeptide having an amino acid 
sequence selected from the group consisting of: an amino 
acid sequence substantially identical to SEQ ID NO: 2, an 
amino acid sequence substantially identical to SEQ ID 
NO: 4, an amino acid sequence substantially identical to 
SEQ ID NO: 6, an amino acid sequence substantially 
identical to SEQ ID NO: 8, an amino acid sequence 
substantially identical to SEQ ID NO: 10, an amino acid 
sequence substantially identical to SEQ ID NO: 12, and an 
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amino acid sequence substantially identical to SEQ ID 
NO: 14 . 



11, The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO:2. 



12. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID N0:4. 

13. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 6. 

14. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 8. 

15. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 10. 

16. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 12. 

17. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 14. 

18. A transgenic plant containing a nucleic acid 
construct comprising a polynucleotide selected from the 
group consisting of: 

a) SEQ ID N0:1; 

b) SEQ ID N0:3; 

c) SEQ ID NO: 5; 

d) SEQ ID NO:7; 

e) SEQ ID NO: 9; 

f) SEQ ID NO: 11; 

g) SEQ ID NO: 13; 

h) an RNA analog of SEQ ID NO:l; 
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i) an RNA analog of SEQ ID NO: 3; 
j) an RNA analog of SEQ ID NO: 5; 
k) an RNA analog of SEQ ID NO: 7; 
1) an RNA analog of SEQ ID NO: 9; 
m) an RNA analog of SEQ ID NO: 11; 
n) an RNA analog of SEQ ID NO: 13; 

o) a polynucleotide having a nucleic acid sequence 
complementary to a) , b) , c) , d) , e) , f ) , g) , h) , i) , j ) , 
k) , 1) , m) , or n) ; and 

p) a nucleic acid fragment of a) , b) , c) , d) , e) , 
f ) , g) , h) , i) , j) , k) , 1) , m) , n) , or o) that is at 
least 15 nucleotides in length and that hybridizes under 
stringent conditions to genomic DNA encoding the 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, or SEQ ID NO: 14. 

19. The plant of claim 18, wherein said construct 
further comprises a regulatory element operably linked to 
said polynucleotide. 

20. The plant of claim 19, wherein said regulatory 
element is a tissue-specific promoter. 

21. The plant of claim 20, wherein said regulatory 

element is an epidermal cell-specific promoter. >< 

22. The plant of claim 20, wherein said regulatory 
element is a seed- specif ic promoter that is operably 
linked in sense orientation to said polynucleotide. 

23. The plant of claim 22, wherein said plant has 
altered levels of very long chain fatty acids in seeds 
compared to the levels in a plant lacking said nucleic 
acid construct. 
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linked. Examples of regulatory sequences are known in 
the art and include, without limitation, minimal 
promoters and promoters of genes preferentially or 
exclusively expressed in seeds or in epidermal cells of 
5 stems and leaves. Native regulatory sequences of the 
polynucleotides disclosed herein can be readily isolated 
by those skilled in the art and used in constructs of the 
invention. Other examples of suitable regulatory 
sequences include enhancers or enhancer-like elements, 

10 introns, 3 ' non- coding regions such as poly A sequences 
and other regulatory sequences discussed herein. 
Molecular biology techniques for preparing such chimeric 
genes are known in the art. 

In other embodiments, a transgenic plant contains 

15 a nucleic acid construct comprising a partial or a full- 
length KAS coding sequence operably linked to at least 
one suitable regulatory sequence in antisense 
orientation. The chimeric gene can be introduced into a 
plant and transgenic progeny displaying expression of the 

2 0 antisense construct are identified. 

One may use a polynucleotide disclosed herein for 
cosuppression as well as for antisense inhibition. 
Cosuppression of genes in plants may be achieved by 
expressing, in the sense orientation, the entire or 

2 5 partial coding sequence of a gene. See, e.g., WO 

04X11516, incorporated herein by reference. 

Transgenic techniques for use in the invention 
include, without limitation, Agrobac Cerium- mediated 
transformation, viral vector-mediated transformation 

3 0 electroporation and particle gun transformation. 

Illustrative examples of transformation techniques are 
described in U.S. Patent 5,204,253, (particle gun) and 
U.S. Patent 5,188,958 (Agrobacterium) , incorporated 
herein by reference. Transformation methods utilizing 
35 the Ti and Ri plasmids of Agrobacterium spp. typically 
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use binary- type vectors. Walkerpeach, C. et al . , in 
Plant Molecular Biology Manual, S. Gelvin and R. 
Schilperoort , eds., Kluwer Dordrecht, Cl:l-19 (1994). If 
cell or tissue cultures are used as the recipient tissue 
5 for transformation, plants can be regenerated from 

transformed cultures by techniques known to those skilled 
in the art. 

Techniques are known for the introduction of DNA 
into monocots as well as dicots, as are the techniques 

10 for culturing such plant tissues and regenerating those 
tissues, Monocots which have been successfully 
transformed and regenerated include wheat, corn, rye, 
rice, and asparagus- See, e.g., U.S. Patent Nos. 
5,484,956 and 5,550,318, incorporated herein by 

15 reference. 

For efficient production of transgenic plants from 
plant cells, it is desirable that the plant tissue used 
for transformation possess a high capacity for 
regeneration. Transgenic plants of woody species such as 

20 poplar and aspen have also been obtained. Technology is 
also available for the manipulation, transformation, and 
regeneration of gymnosperm plants. For example, U.S. 
Patent No. 5,122,466 describes the biolistic 
transformation of conifers, with preferred target tissue 

25 being meristematic and cotyledon and hypocotyl tissues. 
U.S. Patent No. 5,041,382 describes enrichment of conifer 
embryonal cells. 

Seeds produced by a transgenic plant (s) can be 
grown and then selfed (or out crossed and selfed) to 

30 obtain seeds homozygous for the construct. Seeds can be 
analyzed in order to identify those homozygotes having 
the desired expression of the construct. Transgenic 
plants may be entered into a breeding program, e.g., to 
introgress the novel construct into other lines, to 

35 transfer the construct to other species, or for further 
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selection of other desirable traits. Alternatively, 
transgenic plants may be propagated vegetatively for 
those species amenable to such techniques. A nucleic 
acid construct of the invention can alter the levels of 
5 very long chain fatty acids in plant tissues expressing 
the polynucleotide, compared to VLCFA levels in 
corresponding tissues from an otherwise identical plant 
not expressing the polynucleotide. A comparison can be 
made, for example, between a non- transgenic plant of a 

10 plant line and a transgenic plant of the same plant line. 
Levels of VLCFAs having 20-32 carbons and/or levels of 
VLCFAs having 32-60 carbons can be altered in plants 
disclosed herein* Plants having an altered VLCFA 
composition may be identified by techniques known to the 

15 skilled artisan, e.g., thin layer chromatography or gas- 
liquid chromatography (GLC) analysis of the appropriate 
plant tissue. 

A suitable group of plants with which to practice 
the invention are the Brassica species, including B. 

20 napus, B. rapa f B. juncea, and B. hirta. Other suitable 
plants include, without limitation, soybean {Glycine 
max) , sunflower {Helianthus annuus) and corn (Zea mays) . 

A method according to the invention comprises 
introducing a nucleic acid construct into a plant cell 

25 and producing a plant (as well as progeny of such a 
plant) from the transformed cell. Progeny includes 
descendants of a particular plant or plant line, e.g., 
seeds developed on an instant plant are descendants. 
Progeny of an instant plant include seeds formed on F 1# 

3 0 F 2 , F 3 , and subsequent generation plants, or seeds formed 
on BC X , BC 2 , BC 3 , and subsequent generation plants. 

Methods and compositions according to the 
invention are useful in that the resulting plants and 
plant lines have desirable alterations in very long chain 

35 fatty acid composition. Suitable tissues in which to 
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express polynucleotides and/or polypeptides of the 
invention include, without limitation, seeds, stems and 
leaves. Leaf tissues of interest include cells and 
tissues of the epidermis, e.g., cells that are involved 
5 in forming trichomes. Of particular interest are 

epidermal cells involved in forming the cuticular layer. 
The cuticular layer comprises various very long chain 
fatty acids and VLCFA derivatives such as alkanes, 
esters, alcohols and aldehydes. Altering the composition 

10 and amount of VLCFAs in epidermal cells and tissues may 
enhance defense mechanisms and drought tolerance of 
plants disclosed herein. 

Polynucleotides of the invention can be used as 
markers in plant genetic mapping and plant breeding 

15 programs. Such markers may include RFLP, RAPD, or PCR 
markers, for example. Marker-assisted breeding 
techniques may be used to identify and follow a desired 
fatty acid composition during the breeding process. 
Marker- assisted breeding techniques may be used in 

2 0 addition to, or as an alternative to, other sorts of 

identification techniques. An example of marker-assisted 
breeding is the use of PCR primers that specifically 
amplify a sequence from a desired KAS that has been 
introduced into a plant line and is being crossed into 
25 other plant lines. 

Plants and plant lines disclosed herein preferably 
have superior agronomic properties. Superior agronomic 
characteristics include, for example, increased seed 
gemination percentage, increased seedling vigor, 

3 0 increased resistance to seedling fungal diseases (damping 

off, root rot and the like) , increased yield, and 
improved standability . 

While the invention is susceptible to various 
modifications and alternative forms, certain specific 
3 5 embodiments thereof are described in the general methods 
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and examples set forth below. it should be understood, 
however, that these examples are not intended to limit 
the invention to the particular forms disclosed but, 
instead the invention is to cover all modifications, 
5 equivalents and alternatives falling within the scope of 
the invention. 

EXAMPLES 
Example 1 

Cloning and Expression of FAE1 in Yeast Cells 

10 The open reading frame of the Arabidopsis FAE1 

gene was amplified directly by PCR, using Arabidopsis 
thaliana cv. Columbia genomic DNA as a template, pfu DNA 
polymerase and the following primers: 
5 ' CTCGAGGAGCAATGACGTCCGTTAA-3 ' and 5'- 

15 CTCGAGTTAGGACCGACCGTTTTG-3' . The PCR product was blunt - 
end cloned into the Eco RV site of pBluescript 
(Stratagene, La Jolla, CA) , 

The FAE1 gene was excised from the Bluescript 
vector with BamHl, and then subcloned into the pYEUra3 

20 {Clontech, Palo Alto, CA) . pYEUra3 is a yeast 
centromere -containing, episomal plasmid that is 
propagated stably through cell division. The FAE1 gene 
was inserted downstream of a GAL1 promoter in pYEUra3 . 
The GAL1 promoter is induced when galactose is present in 

25 the medium and repressed when glucose is present in the 
growth medium* 

Insertion of the FAB1 gene in the sense 
orientation was confirmed by PCR, and pYEUra3 / FAE1 was 
used to transform Saccharomyces cerevisiae strain AB1380 

3 0 using a lithium acetate procedure as described in Gietz, 
R. and Woods, R., in Molecular Genetics of Yeast: 
Practical Approaches, Oxford Press, pp. 121-134 (1994) . 
Plasmid DNA was isolated from putative transf ormants, and 



WO 98/54954 



PCT/US98/11384 



-la- 
the presence of the FAEl/ pYEUra3 construct was confirmed 
by Southern analysis. 

Yeast transformed with pYEUra3 having FAEl 
operably linked to the GAL1 promoter were grown in the 
5 presence of galactose or glucose and were analyzed for 
the expression of FAEl. As a control, yeast transformed 
with pYEUra3 containing no insert were also assayed. 
Analysis of such control preparations yielded fatty acid 
compositions and fatty acid elongation rates similar to 

10 those of yeast transformed with pYEUr a3 / FAEl and grown 
with glucose as the carbon source . 

The fatty acid composition of yeast cells grown in 
the presence of galactose was compared to that of cells 
grown in the presence of glucose, to determine if VLCFA 

15 were found in the galactose- induced cells. 

Transformed yeast cells were grown overnight in 
YPD media at 30°C with vigorous shaking. One hundred fil 
of the overnight culture were used to inoculate 40 ml of 
complete minimal uracil dropout media (CM-Ura) 

20 supplemented with either glucose or galactose (2% w/v) . 
Cultures were grown at 30°C to an OD 600 of approximately 
1.3 to 1.5. Cells were harvested by centrifugation at 
5000 Xg for 10 min. Total lipids were extracted from the 
cells with 2 volumes of 4N KOH in 100% methanol for 60 

25 min. at B0°C. Fatty acids were saponified and methyl 
esters were prepared by drying the samples and 
resuspending in 0.5 ml of boron trichloride in methanol 
(10% v/v) . Samples were incubated at 50°C for 15 min in 
a sealed tube. About 2 ml of water was then added and 

30 the fatty methyl esters were extracted thrice with 1 ml 
of hexane. Extracts were dried under nitrogen and 
redissolved in hexane. See Hlousek-Radojcic, A- et al., 
Plant J. 8:803-809. Methyl esters were analyzed on an HP 
5890 series II gas chromatograph equipped with a 5771MSD 

35 and 7673 auto injector (Hewlett-Packard, Cincinnati, OH) . 
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Methyl esters were separated on a DB-23 (J&W Scientific) 
capillary column (30 m X 0 .25 mm X 0 . 25 //m) . The column 
was operated with helium carrier gas and splitless 
injection (injection temperature 280°C, detector 
5 temperature 280°C) . After an initial 3 min. at 100°C, 
the oven temperature was raised to 250° at 20°C min" 1 and 
maintained at that temperature for an additional 3 min. 
The identity of the peaks was verified by 
cochromatography with authentic standards and by mass 

10 spectrometer analysis. 

The results clearly revealed the appearance of 
both 20:1 and 22:1 acyl-CoA products in galactose -induced 
yeast containing the FAE1 coding sequence. Uninduced 
yeast cells failed to accumulated significant amounts of 

15 fatty acids longer than C18. These results indicate that 
expression of FAE1 in yeast resulted in functional KAS 
activity and functional elongase activity. 

Example 2 
FAE1 Activity in Yeast Microsomes 

20 The functional expression of the FAE1 KAS was 

analyzed by isolating microsomes from transformed yeast 
cells and assaying these microsomes in vitro for elongase 
activity. 

Transformed yeast cells were grown in the presence 
25 of either glucose or galactose (2% w/v) as described in 
Example 1. Cells were harvested by centrifugation at 
5000 Xg for 10 min and washed with 10 ml ice cold 
isolation buffer (IB) , which contains 80 mM Hepes-KOH, pH 
7.2, 5 mM EGTA, 5 mM EDTA, 10 mM KC1, 320 mM sucrose and 
30 2 mM DTT) . Cells were then resuspended in enough IB to 
fill a 1.7 ml tube containing 700 fil of 0.5 fxm glass 
beads and yeast microsomes were isolated from the cells 
essentially as described in Tillman, T. and Bell, R., J. 
Biol. Chem. 261:9144-9149 (1986). The microsomal 
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membrane pellet was recovered by centrif ugation at 
252,000 xg for 60 min. The pellet was rinsed by 
resuspending in 4 0 ml fresh IB and again recovered by 
centrifugation at 252000 Xg for 60 min. Microsomal 
5 pellets were resuspended in a minimal volume of IB, and 
the protein concentration adjusted to 2.5 fig /zl* 1 by 
addition of IB containing 15% glycerol. Microsomes were 
frozen on dry ice and stored at -80°C, The protein 
concentration in microsomes was determined by the 

10 Bradford method (Bradford, 1976) . 

Fatty acid elongase activity was measured 
essentially as described in Hlousek-Radojcic, A, et al . , 
Plant J. 8:803-809 (1995). Briefly, the standard 
elongation reaction mix contained 80 mM Hepes-KOH, pH 

15 7.2, 20 mM MgCl 2 , 500 (iM NADPH, 1 mM ATP, 100 uM malonyl- 
CoA, 10 ptM CoA-SH and 15 ^M radioactive acyl-CoA 
substrate. The radiolabeled substrate was either [1* 
14 C]18:l-CoA (50 uCi /imol 1 ) , [r 14 C] 18 : 0-CoA (55 uCi junol" 
1 ) , or [1" 14 C] 16:0-CoA (54 uCi /zmol' 1 ) . The reaction was 

20 initiated by the addition of yeast microsomes (5 pg 
protein) and the mixture incubated at 30° C for the 
indicated period of time. The final reaction volume was 
25 /zl. 

Methyl esters of the acyl-CoA elongation products 
25 were prepared as described in Example 1. Methyl esters 
were separated on reversed phase silica gel KC18 TLC 
plates (Whatman, 250 uM thick) , quantified by 
phosphor imaging, and analyzed on by ImageQuant software 
(Molecular Dynamics, Inc., Sunnyvale, CA) . The detection 
30 ^imit for each product is about 0.001 nanomoles per min. 
per mg microsomal protein, depending on the phosphorimage 
exposure time. 

Results of representative in vitro elongation 
assays are shown in Figs. 1 and 2. The results indicate 
35 that microsomes from galactose- induced cells expressing 
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FAE1 catalyzed multiple cycles of elongation starting 
with either C16:0 acyl CoA, C18:0 acyl CoA, or C18:l 
acyl-CoA as the substrate (Fig. 1). The 16:0 and 18:0 
acyl-CoA substrates were elongated to C26:0 acyl-CoA. In 
5 contrast, the 18:l-CoA substrate was elongated primarily 
to C20:l, with only low levels of C22:l acyl -CoA being 
produced. Occasionally, trace levels of C24:l CoA were 
also observed. Although the chain length of the products 
from the 18:1 acyl-CoA substrate were less than the chain 

10 length from the saturated acyl -CoA substrates, the rate 
of elongation of oleoyl-CoA was about 2- and 3- fold 
higher than the rates of elongation of 16:0 -CoA and 18:0- 
CoA, respectively . 

The elongation activity observed in microsomes 

15 from uninduced cells indicated that there was a low level 
of endogenous elongase activity when 18:l-CoA or 18:0-CoA 
were used as substrates. There was substantial 16:0 -CoA 
elongase activity (10.1 nmol mg protein" 1 at 3 0 min) in 
microsomes from uninduced cells (Fig. 2) . However, the 

20 major product of 16:0 elongation using uninduced 

microsomes was C18:0 acyl CoA, with only small amounts of 
products beyond this length. The elongation of the 16:0 
acyl -CoA substrate presumably is due to an endogenous 
yeast elongase. 

25 Elongation of 18:1 CoA by microsomes from induced 

cells occurred at a rate about 18 -fold higher than in 
microsomes isolated from the uninduced cells (Fig. 2) . 
With microsomes from induced yeast, synthesis of 20:0 CoA 
from the 16:0 CoA substrate, occurred at a rate similar 

30 to that seen when the substrate was 18:0 CoA (4.2 vs. 5.1 
nmol mg protein" 1 ) . The total rate of elongation of [ 14 C] 
16:0-CoA by microsomes from induced cells (15.8 nmol mg 
protein" 1 at 30 min.) was more than 50% higher than 
elongation of [ 14 C] 16:0 -CoA by microsomes from uninduced 

35 cells, suggesting that the FAE1 KAS utilized 16:0-CoA as 
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a substrate in addition to C18-C24 acyl-CoAs. The FAE1 
elongase KAS activity, i.e., the difference in the 16:0 
elongation rates between microsomes from induced and 
uninduced cells, was 5.7 nmol mg protein -1 . The 
5 elongation rate with the 16:0 substrate thus was similar 
to the elongase activity of the FAE1 elongase KAS with 
the 18:0 substrate. 

These results indicate that FAE1 KAS expressed in 
yeast could synthesize 3-ketoacyl-CoA in vitro and, in 
10 combination with yeast reductases and dehydrases, could 
form a functional VLCFA elongase complex. In addition, 
these results suggest that FAE1 is membrane -bound in 
yeast cells. 

Example 3 

15 Cloning and Sequencing of Arabidopsis Elongase Genes 

The sequence of a jojoba seed cDNA (see WO 
93/10241 and WO 95/15387, incorporated herein by 
reference) was used to search the Arabidopsis expressed 
sequence tag (EST) database of the Arabidopsis Genome 

20 Stock Center (The Ohio State University, Columbus, Ohio) . 
The BLAST computer program (National Institutes of 
Health, Bethesda, MD, USA) was used to perform the 
search. The search identified two ESTs (ATTS1282 and 
ATTS3218) that had a high degree of sequence identity 

25 with the jojoba sequence. The ATTS1282 and ATTS3218 ESTs 
appeared to be partial cDNA clones rather than full- 
length clones based on the length of the jojoba sequence. 

A genomic DNA library from Arabidopsis thaliana 
cv. Columbia, was prepared in the lambda GEM11 vector 

30 (Promega, Madison, Wisconsin) and was obtained from Ron 
Davis, Stanford University, Stanford, CA. The library 
was hybridized with ATTS1282 and ATTS3218 as probes and 2 
clones were identified for each EST. Phage DNA was 
isolated from each of the hybridizing clones, the genomic 
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insert was excised with the restriction enzyme Sac I and 
subcloned into the plasmid pBluescript (Stratagene, La 
Jolla, CA) . One cl one from the ATTS1282 hybridization 
was designated ELI and one clone from the ATTS3218 
5 hybridization was designated EL2 . 

A yeast expression library, containing cDNA from 
Arabidopsis thaliana cv. Columbia, was prepared in the 
lambda YES expression vector described in Elledge et al . 
(Elledge, S. et al'. , Proc. Natl. Acad. Sci USA 88:1731-* 

10 1735 (1991) and was obtained from Ron Davis at Stanford 
University, Stanford, CA. The library was hybridized 
with a EL2 partial cDNA probe. A full-length EL2 cDNA 
was not identified. However, the probe did identify a 
full-length cDNA which was designated EL3 . 

15 A consensus sequence for the C- terminal region of 

ELI, EL2 and the jojoba cDNA polypeptides was identified 
by sequence alignment using DNA analysis programs from 
DNAStar, Madison, Wisconsin. This consensus sequence was 
used to search the Arabidopsis EST database again for fi- 

2 0 keto acyl synthase sequences. These searches identified 
four additional putative S-keto acyl synthase ESTs, which 
were designated EL4 through EL7. EL4, EL5, EL6, and EL7 
have homology to Genbank Accession Nos. T04345, T44939, 
T22193 and T76700, respectively. 

25 The lambda YES cDNA expression library described 

above was hybridized with the ELI and EL4-EL7 ESTs as 
probes. This screen identified full-length cDNAs for 
ELI, ELS and EL6. 

The lambda GEM11 genomic library was hybridized 

30 with the EL4 and EL7 ESTs as probes. This screen 

identified full-length genomic clones for EL4 and EL7. 
Phage DNA was isolated from each of the hybridizing 
clones and subcloned into pBluescript as described above. 
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The 7 EL clones were sequenced on both strands 
with regions of overlap for each sequence run. 
Sequencing was carried out with an ABI automated 
sequencer (Applied Biosystems, Inc., Foster City, 
5 California), following the manufacturer's instructions. 

The nucleotide sequences for the coding regions of 
EL1-EL7 are shown in Figs. 3, 5, 7, 9, 11, 13 and 15, 
respectively. The deduced amino acid sequences for EL1- 
EL7 are shown in Figs. 4, 6, 8, 10, 12, 14 and 16, 

10 respectively, using the standard one-letter amino acid 
code. The ELI, EL2 and EL7 genomic clones appeared to 
lack introns. The EL4 genomic clone contained one intron 
near the 5' end of the coding region. 

The nucleotide sequences of the 7 EL 

15 polynucleotides were compared to 5 DNA sequences present 
in Genbank. Genbank, National Center for Biotechnology 
Information, Bethesda, MD. Two of the 5 accessions were 
cloned from members of the Brassicaceae : the Arabidopsis 
FAE1 sequence (Accession U29142) and a Brassica napus 

20 sequence (Accession U50771) . Three of the accessions 
were cloned from jojoba (Simmondsia chinensis) : 2 wax 
biosynthesis genes (Accessions 114084 and 114085) and a 
jojoba KAS gene (Accession U37088) . See also U.S. Patent 
5,445,947, incorporated herein by reference. 

25 Multiple alignment of the 12 sequences was carried . < 

out with a computer program sold under the trade name 
MEGALIGN Lasergene by DNAStar (Madison, Wisconsin) . 
Alignments were done using the Clustal method with 
weighted residue weight table. The nucleotide sequence 

30 similarity index and percent divergence based on the 
multiple alignment algorithm is shown in Table 1. The 
nucleotide sequences of EL1-EL7 are distinguishable from 
the 5 DNA sequences obtained from Genbank. 

The deduced amino acid sequences of the ELI -7 

35 polypeptides were compared with the MEGALIGN program to 
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the deduced amino acid sequences of the same 5 Genbank 
clones, using the Clustal method with PAM250 residue 
weight table. The amino acid sequence similarity and 
percent divergence are shown in Table 2 . The amino acid 
5 sequences of EL1-EL7 polypeptides are distinguishable 
from those of the Genbank sequences . 



WO 98/54954 



26 



PCT/US98/1S384 




WO 98/54954 



27 



PCT/US98/11384 




WO 98/54954 



PCT/US98/U384 



- 28 - 

Example 4 
Expression of ELI and EL2 in Yeast 

The open reading frames (ORFs) for the EL2, EL4 
and EL7 clones were amplified by PCR. The EL2 ORF was 
cloned into XYES using the primers: 

CTCGAGCAAGTCCACTACCACGCA and CTCGAGCGAGTCAGAAGGAACAAA . 
The EL4 ORF was cloned into pYEUra3 using the primers: 
GATAATTTAGAGAGGCACAGGGT and GTCGACACAAGAATGGGTAGATCCAA. 
The EL7 ORF was cloned into pYEUra3 using the primers: 
CAGTTCCTCAAACGAAGCTA and GTCGACTTCTCAATGGACGGTGCCGGA . 
Amplified products were cloned into pYEUra3 under the 
control of, and 3' to, the GAL1 promoter. The resulting 
plasmids were transformed into yeast as described in 
Example 1 . 

Yeast cultures containing full-length ELI in XYES 
and full-length EL2 in pYEUra3 were grown in the presence 
of galactose or glucose as described in Example 2 . 
Microsomes were then prepared from each of the cultures 
and fatty acid elongation assays were carried out as 
described in Example 2 . 

In the first experiment, microsomes were prepared 
from galactose-induced cultures of ELI, EL2 and FAE1, and 
incubated with either [1- 14 C] 18:0 acyl-CoA or [1- 14 C] 18:1 
acyl-CoA as substrate. The amounts of various reaction 
products synthesized after 30 minutes (min) were 
determined as described in Example 2. The results when 
18:0 acyl-CoA was the substrate are shown in Table 3. 
The results when 18:1 acyl-CoA was the substrate are 
shown in Table 4 . 
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The results shown in Tables 3 and 4 indicate that 
the ELI and EL2 gene products have S-ketoacyl synthase 
(KAS) activity and that the KAS reaction product can be 
utilized to form VLCFAs . The specific activities of the 
3 KAS enzymes cannot be compared, since the relative 
amount of the heterologous KAS protein in each microsomal 
preparation is not known. However, the proportions of 
various reaction products can be compared between FAE1, 
ELI and EL2 . 

The data shown in Table 3 indicate that the ELI 
and EL2 KAS activities result in a higher proportion of 
saturated VLCFAs than does the FAE1 KAS activity. These 



SUBSTITUTE SHEET (RULE 26) 



WO 98/54954 



PCT/US98/11384 



- 30 - 

results suggest that ELI and EL2 encode novel gene 
products, because ELI and EL2 have a greater preference 
for C22:0 and C24:0 acyl-CoA substrates than does FAE1 . 

A comparison of the relative elongation activity 
of FAE1 with 18:0 and 18:1 substrates (Tables 3 and 4) 
indicates that FAE1 is more active when 18:1 is the 
substrate than when 18:0 is the substrate. In contrast, 
the overall rate of product formation with ELI is less 
when 18:1 is the substrate than when 18:0 is the 
substrate (Tables 3 and 4) . EL2 is also less active when 
18:1 is the substrate than when 18:0 is the substrate 
(Tables 3 and 4) . These results support the conclusion 
that ELI and EL2 encode novel gene products and suggest 
that ELI and EL2 have a preference for saturated fatty 
acids as substrates, whereas the FAE1 gene product has a 
preference for monounsaturated fatty acids as substrates. 

In a second experiment, microsomes were prepared 
from galactose- induced and from glucose-repressed yeast 
cultures containing ELI or EL2 coding sequences . The 
microsomal preparations were incubated with either 18:0 
acyl-CoA or 18:1 acyl-CoA as substrate and the fatty acid 
reaction products determined as described above. The 
results with the 18:0 substrate are shown in Table 5. 
The results with the 18:1 substrate are shown in Table 6. 
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Table 5. 

Elongation of 18:0-CoA by ELI and EL2 
With and Without Induction of Gene Expression 





S-Keto Acyl S} 


mthase Gene 


Acyl 


ELI 


EL2 


♦Glucose 


♦Galactose 


♦Glucose 


♦Galactose 


Co* 


Rate 1 


(%) 


Race 


(%) 


Rate 


(%) 


Rate 


(%V 


20*0 


0.007 


100.0 


0.074 


55.6 


0.030 


91.3 


0.107 


43.1 


22:0 


0.000 


0.0 


0.023 


17.4 


0.002 


5.1 


0.044 


17.8 


24:0 


0.000 


0.0 


0.020 


15.3 


0.005 


13.6 


0.048 


19. 1 


26:0 


0.000 


0.0 


0.015 


11.5 


0.000 


0.0 


0.050 


20. 0 


Total 


0.007 


100.0 


0.133 


100.0 


0.037 


100.0 


0.249 


100.0 



1 Nanomoles/rainute/tng of aicroiotnal protein 



Table 6. 

Elongation of 18:l-CoA by ELI and EL2 
With and Without Induction of Gene Expression 





S-Keto Acyl Synthase Gene 


Acyl 
CoA 


1 


EL2 


♦Glucoie 


♦Galactose 


♦Glucose 


♦Oalactose 


Rate 1 


(«) 


Rate 


(%) 


Rate 


(%> 


Rate 


<%! 


20:1 


0.062 


100.0 


0.081 


100.0 


0.043 


100.0 


0.089 


100.0 


22:1 


0.000 


0.0 


0.000 


0.0 


o.ooo 


0.0 


0.000 


0.0 


24:1 


0.000 


0.0 


0.000 


0.0 


0.000 


0.0 


0.000 


0.0 


26.1 


0.000 


0.0 


0.000 


0.0 


| 0.000 


0.0 


0.000 


0.0 


Total 


1 0.062 


100.0 


0.081 


100.0 H 0.043 


100.0 


0.089 


100.0 



1 Nanooolea/minute/ng of microaowal protein 



The results in Table 5 show in vitro elongase 
activity for ELI and EL2 under induced (galactose) and 
uninduced (glucose) conditions. The comparison indicates 
that induction with galactose results in a large increase 
in overall elongase activity when 18:0 acyl CoA is the 
substrate (about 19 -fold and 7-fold for ELI and EL2 , 
respectively). In contrast, induction when 18:1 acyl CoA 
is the substrate results in only a small increase in 
elongase activity (about 1.3 -fold and 2 -fold for ELI and 
E12, respectively), as shown in Table 6. 

The results in Table 5 show that little or no 
VLCFA products are made by yeast microsomes under 
uninduced conditions. Upon induction of ELI and EL2 gene 
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expression, however, significant quantities of C20:0, 
C22:0, C24:0 and C26-.0 are made. The data in Tables 5 
and 6 are consistent with the results in Tables 3 and 4, 
which indicated that ELI and EL2 were more active with a 
saturated fatty acid substrate than with a 
monounsaturated substrate. 

The data in Tables 5 and 6 are also consistent 
with the data in Tables 3 and 4 indicating that the ELI 
and EL2 gene products are more active in converting C24:0 
to C26:0 than is FAE1. 

In a third experiment, microsomes from induced and 
uninduced cultures containing ELI or EL2 were incubated 
in the absence of cofactors involved in the S-ketoacyl 
condensation reaction. Cultures were induced and 
microsomes were prepared as described in Example 2 . In 
vitro assays were carried out as described in Example 2, 
except that either ATP, CoASH or both were omitted from 
the enzyme reaction mixture. In addition, one reaction 
was carried out in a complete mixture having 0.01 mM of 
cerulenin (Sigma, St. Louis, MO). Cerulenin is an 
inhibitor of some condensing enzymes. The results are 
shown in Tables 7-9. 
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Table 7. 

Effect of Cof actors on 18:0-CoA Elongation 1 



Gene 


Expt 4 


+Glu 2 


+Gal J 


-ATP 3 


-CoA 3 


-A&C 3 


+ Cer 3 


ELI 


1 


.037 


.109 


.095 


.105 


.119 


.141 


2 


N.D. 


.090 


.125 


.093 


.270 


.176 


EL2 


1 


.033 


.112 


.168 


.127 


.143 


.238 


2 


N.D. 


.120 


.178 


.133 


.195- 


.302 



Activity in nanomoles/minute/mg of microsomal protein. 

2 +Glu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal: microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose- induced cultures. -ATP: ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 

4 Experiment No. 
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Table 8. 

Effect of Cofactors on Elongation Products of ELI 1 



Prod. 


+Glu 2 


+Gal* 


-ATP 3 


-CoA 1 


-A&C 3 


+Cer* 


20:0 


53.9 


46.2 


34.4 


47.8 


41.7 


46.7 


22:0 


14.4 


18.7 


13.7 


18.0 


19.4 


16.2 


24:0 


18.5 


18.1 


20.6 


19.1 


16.7 


17.7 


26:0 


13.2 


17.1' 


31.4 


15.2 


22.3 


19.4 


Total 


100.0 


100. 0 


100.0 


100.0 


100.0 


100.0 



1 Amount of indicated product as a percent of total products formed. 
Results of one experiment for +Glucose; Average of two experiments for 
other conditions. 

2 +Glu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal: microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose- induced cultures. -ATP: ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 
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Table 9. 

Effect of Cofactors on Elongation Products of EL2 1 



Prod. 


+Glu 2 


+Gal 2 


-ATP 3 


-CoA 3 


-A&C 3 


+Cer 3 


20: 0 


54.5 


47.1 


34.1 


45.3 


38.0 


41.8 


22:0 


17.1 


19.1 


16.4 


19.2 


15.9 


16.1 


24:0 


5.8 


19.4 


20.8 


19.9 


18.4 


20.4 


26: 0 


22.6 


14.5 


28.9 


15.8 


27.8 


21.8 


Total 


100.0 


100. 0 


100.0 


100.0 


100.0 


100.0 



Amount of indicated product as a percent of total products formed. 
Results of one experiment for +Glucose; Average of two experiments for 
other conditions* 

2 -fGlu: microsomes from cultures grown in the presence of glucose and 
incubated in standard reaction mix; +Gal: microsomes from cultures grown 
in the presence of galactose and incubated in standard reaction mix. 

3 Microsomes from galactose- induced cultures. -ATP: ATP omitted from 
reaction mix; -CoA: Coenzyme A omitted from reaction mix; -A&C: ATP and 
Coenzyme A omitted from reaction mix; +Cer: Standard reaction mix 
containing 0.01 mM cerulenin. 

The results in Table 7 indicate that omission of 
ATP and/or CoA from the incubation mixture does not have 
a significant effect on the overall amounts of VLCFAs 
synthesized by the in vitro KAS activity of ELI or EL2 . 
The results also show that cerulenin does not inhibit the 
KAS activity of ELI or EL2 « The data in Table 8 and 9 
confirm that ELI and EL2 KAS activity produces 
significant amounts of C24:0 and C26:0 acyl CoA products. 

To the extent not already indicated, it will be 
understood by those of ordinary skill in the art that any 
one of the various specific embodiments herein described 
and illustrated may be further modified to incorporate 
features shown in other of the specific embodiments. 

The foregoing detailed description has been 
provided for a better understanding of the invention only 
and no unnecessary limitation should be understood 
therefrom as some modifications will be apparent to those 
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skilled in the art without deviating from the spirit and 
scope of the appended claims. 
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(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/868,373 

(B) FILING DATE: 03-JUN-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Lundquist, Ronald C 

(B) REGISTRATION NUMBER: 37,875 

(C) REFERENCE /DOCKET NUMBER: 07039/064WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-335-5050 

(B) TELEFAX: 612-288-9696 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGGATCGAG AGAGATTAAC GGCGGAGATG GCGTTTCGAG ATTCATCATC GGCCGTTATA 60 

AGAATTCGAA GACGTTTGCC GGATTTATTA ACGTCCGTTA AGCTCAAATA CGTGAAGCTT 120 

GGACTTCACA ACTCTTGCAA CGTGACCACC ATTCTCTTCT TCTTAATTAT TCTTCCTTTA 180 

ACCGGAACCG TGCTGGTTCA GCTAACCGGT CTAACGTTCG ATACGTTCTC TGAGCTTTGG 240 

TCTAACCAGG CGGTTCAACT CGACACGGCG ACGAGACTTA CCTGCTTGGT TTTCCTCTCC 300 

TTCGTTTTGA CCCTCTACGT GGCTAACCGG TCTAAACCGG TTTACCTAGT GGATTTCTCC 360 

TGCTACAAAC CGGAAGACGA GCGTAAAATA TCAGTAGATT CGTTCTTGAC GATGACTGAG 420 

GAAAATGGAT CATTCACCGA TGACACGGTT CAGTTCCAGC AAAGAATCTC GAACCGGGCC 480 
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GGTTTGGGAG ACGAGACGTA TCTGCCACGT GGCATAACTT CAACGCCCCC GAAGCTAAAT 54 0 

ATGTCAGAGG CACGTGCCGA AGCTGAAGCC GTTATGTTTG GAGCCTTAGA TTCCCTCTTC 600 

GAGAAAACCG GAATTAAACC GGCCGAAGTC GGAATCTTGA TAGTAAACTG CAGCTTATTC 660 

AATCCGACGC CGTCTCTATC AGCGATGATC GTGAACCATT ACAAGATGAG AGAAGACATC 720 

AAAAGTTACA ACCTCGGAGG AATGGGTTGC TCCGCCGGAT TAATCTCAAT CGATCTCGCT 780 

AACAATCTCC TCAAAGCAAA CCCTAATTCT TACGCTGTCG TGGTAAGCAC GGAAAACATA 84 0 

ACCCTAAACT GGTACTTCGG AAATGACCGG TCAATGCTCC TCTGCAACTG CATCTTCCGA 900 

ATGGGCGGAG CTGCGATTCT CCTCTCTAAC CGCCGTCAAG ACCGGAAGAA GTCAAAGTAC 960 

TCGCTGGTCA ACGTCGTTCG AACACATAAA GGATCAGACG ACAAGAACTA CAATTGCGTG 1020 

TACCAGAAGG AAGACGAGAG AGGAACAATC GGTGTCTCTT TAGCTAGAGA GCTCATGTCT 1080 

GTCGCCGGAG ACGCTCTGAA AACAAACATC ACGACTTTAG GACCGATGGT TCTTCCATTG 1140 

TCAGAGCAGT TGATGTTCTT GATTTCCTTG GTCAAAAGGA AGATGTTCAA GTTAAAAGTT 1200 

AAACCGTATA TTCCGGATTT CAAGCTAGCT TTCGAGCATT TCTGTATTCA CGCAGGAGGT 1260 

AGAGCGGTTC TAGACGAAGT GCAGAAGAAT CTTGATCTCA AAGATTGGCA CATGGAACCT 1320 

TCTAGAATGA CTTTGCACAG ATTTGGTAAC ACTTCGAGTA GCTCGCTTTG GTATGAGATG 1380 

GCTTATACCG AAGCTAAGGG TCGGGTTAAA GCTGGTGACC GACTTTGGCA GATTGCGTTT 144 0 
GGATCGGGTT TCAAGTGTAA TAGTGCGGTT TGGAAAGCGT TACGACCGGT TTCGACGGAG • 1500 

GAGATGACCG GTAATGCTTG GGCTGGTTCG ATTGATCAAT ATCCGGTTAA AGTTGTGCAA 1560 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Asp Arg Glu Arg Leu Thr Ala Glu Met Ala Phe Arg Asp Ser Ser 

15 10 15 

Ser Ala Val lie Arg lie Arg Arg Arg Leu Pro Asp Leu Leu Thr Ser 

20 25 30 

Val Lys Leu Lys Tyr Val Lys Leu Gly Leu His Asn Ser Cys Asn Val 

35 40 45 

Thr Thr He Leu Phe Phe Leu He He Leu Pro Leu Thr Gly Thr Val 

50 55 60 

Leu Val Gin Leu Thr Gly Leu Thr Phe Asp Thr Phe Ser Glu Leu Trp 
65 70 75 80 

Ser Asn Gin Ala Val Gin Leu Asp Thr Ala Thr Arg Leu Thr Cys Leu 

85 90 95 

Val Phe Leu Ser Phe Val Leu Thr Leu Tyr Val Ala Asn Arg Ser Lys 

100 105 110 

Pro Val Tyr Leu Val Asp Phe Ser Cys Tyr Lys Pro Glu Asp Glu Arg 

115 120 125 

Lys He Ser Val Asp Ser Phe Leu Thr Met Thr Glu Glu Asn Gly Ser 

130 135 140 

Phe Thr Asp Asp Thr Val Gin Phe Gin Gin Arg He Ser Asn Arg Ala 
145 150 155 160 

Gly Leu Gly Asp Glu Thr Tyr Leu Pro Arg Gly He Thr Ser Thr Pro 

165 170 175 

Pro Lys Leu Asn Met Ser Glu Ala Arg Ala Glu Ala Glu Ala Val Met 

180 185 190 

Phe Gly Ala Leu Asp Ser Leu Phe Glu Lys Thr Gly He Lys Pro Ala 

195 200 205 

Glu Val Gly He Leu He Val Asn Cys Ser Leu Phe Asn Pro Thr Pro 

210 215 220 

Ser Leu Ser Ala Met He Val Asn His Tyr Lys Met Arg Glu Asp He 
225 230 235 240 

Lys Ser Tyr Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Leu He Ser 

245 250 255 

He Asp Leu Ala Asn Asn Leu Leu Lys Ala Asn Pro Asn Ser Tyr Ala 

260 265 270 

Val Val Val Ser Thr Glu Asn He Thr Leu Asn Trp Tyr Phe Gly Asn 
275 280 285 
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Asp 


Arg 


Ser 


Met 


Leu 


Leu 


Cys 


Asn 


Cys 


He Phe Arg 


Met 


Gly 


Gly 


Ala 




290 










295 






300 








Ala 


He 


Leu 


Leu 


Ser 


Asn 


Arg 


Arg 


Gin 


Asp Arg Lys 


Lys 


Ser 


Lys 


Tyr 


305 










310 








315 








320 


Ser 


Leu 


Val 


Asn 


Val 


Val 


Arg 


Thr 


His 


Lys Gly Ser 


Asp Asp 


Lys 


Asn 










325 










330 






335 




Tyr 


Asn 


Cys 


Val 
340 


Tyr 


Gin 


Lys 


Glu 


Asp 
345 


Glu Arg Gly 


Thr 


He 
350 


Gly 


Val 


Ser 


Leu 


Ala 


Arg 


Glu 


Leu 


Met 


Ser 


Val 


Ala Gly Asp 


Ala 


Leu 


Lys 


Thr 






355 










360 




365 






Asn 


He 
370 


Thr 


Thr 


Leu 


Gly 


Pro 
375 


Met 


Val 


Leu Pro Leu 
380 


Ser 


Glu 


Gin 


Leu 


Met 


Phe 


Leu 


He 


Ser 


Leu 


Val 


Lys 


Arg 


Lys Met Phe 


Lys 


Leu 


Lys 


Val 


385 










390 








395 








400 


Lys 


Pro 


Tyr 


He 


Pro 
405 


Asp 


Phe 


Lys 


Leu 


Ala Phe Glu 
410 


His 


Phe 


Cys 
415 


lie 


His 


Ala 


Gly 


Gly 


Arg 


Ala 


Val 


*Leu 


Asp 


Glu Val Gin 


Lys Asn 


Leu 


Asp 








420 










425 






430 






Leu 


Lys 


Asp 
435 


Trp 


His 


Met 


Glu 


Pro 
440 


Ser 


Arg Met Thr 


Leu 
445 


His 


Arg 


Phe 


Gly 


Asn 


Thr 


Ser 


Ser 


Ser 


Ser 


Leu 


Trp 


Tyr Glu Met 


Ala Tyr 


Thr 


Glu 




450 










455 






460 










Ala 


Lys 


Gly 


Arg 


Val 


Lys 


Ala 


Gly 


Asp 


Arg Leu Trp 


Gin 


He 


Ala 


Phe 


465 










470 








475 








480 


Gly 


Ser 


Gly 


Phe 


Lys 
485 


Cys 


Asn 


Ser 


Ala 


Val Trp Lys 
490 


Ala 


Leu 


Arg 
495 


Pro 


Val 


Ser 


Thr 


Glu 


Glu 


Met 


Thr 


Gly 


Asn 


Ala Trp Ala 


Gly Ser 


He 


Asp 








500 










505 






510 






Gin 


Tyr 


Pro 
515 


Val 


Lys 


Val 


Val 


Gin 
520 















(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1479 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATGGATTACC CCATGAAGAA GGTAAAAATC TTTTTCAACT ACCTCATGGC GCATCGCTTC 60 

AAGCTCTGCT TCTTACCATT AATGGTTGCT ATAGCCGTGG AGGCGTCTCG TCTTTCCACA 120 

CAAGATCTCC AAAACTTTTA CCTCTACTTA CAAAACAACC ACACATCTCT AACCATGTTC 180 

TTCCTTTACC TCGCTCTCGG GTCGACTCTT TACCTCATGA CCCGGCCCAA ACCCGTTTAT 240 

CTCGTTGACT TTAGCTGCTA CCTCCCACCG TCGCATCTCA AAGCCAGCAC CCAGAGGATC 300 

ATGCAACACG TAAGGCTTGT ACGAGAAGCA GGCGCGTGGA AGCAAGAGTC CGATTACTTG 360 

ATGGACTTCT GCGAGAAGAT TCTAGAACGT TCCGGTCTAG GCCAAGAGAC GTACGTACCC 420 

GAAGGTCTTC AAACTTTGCC ACTACAACAG AATTTGGCTG TATCACGTAT AGAGACGGAG 480 

GAAGTTATTA TTGGTGCGGT CGATAATCTG TTTCGCAACA CGGGAATAAG CCCTAGTGAT 540 

ATAGGTATAT TGGTGGTGAA TTCAAGCACT TTTAATCCAA CACCTTCGCT ATCAAGTATC 600 

TTAGTGAATA AGTTTAAACT TAGGGATAAT ATAAAGAGCT TGAATCTTGG TGGGATGGGG 660 

TGTAGCGCTG GAGTCATCGC TATCGATGCG GCTAAGAGCT TGTTACAAGT TCATAGAAAC 720 

ACTTATGCTC TTGTGGTGAG CACGGAGAAC ATCACTCAAA ACTTGTACAT GGGTAACAAC 780 

AAATCAATGT TGGTTACAAA CTGTTTGTTC CGTATAGGTG GGGCCGCGAT TTTGCTTTCT 840 

AACCGGTCTA TAGATCGTAA ACGCGCAAAA TACGAGCTTG TTCACACCGT GCGGGTCCAT 900 

ACCGGAGCAG ATGACCGATC CTATGAATGT GCAACTCAAG AAGAGGATGA AGATGGCATA 960 

GTTGGGGTTT CCTTGTCAAA GAATCTACCA ATGGTAGCTG CAAGAACCCT AAAGATCAAT 1020 

ATCGCAACTT TGGGTCCGCT TGTTCTTCCC ATAAGCGAGA AGTTTCACTT CTTTGTGAGG 1080 

TTCGTTAAAA AGAAGTTTCT CAACCCCAAG CTAAAGCATT ACATTCCGGA TTTCAAGCTC 1140 

GCATTCGAGC ATTTCTGTAT CCATGCGGGT GGTAGAGCGC TAATTGATGA GATGGAGAAG 1200 

AATCTTCATC TAACTCCACT AGACGTTGAG GCTTCAAGAA TGACATTACA CAGGTTTGGT 1260 

AATACCTCTT CGAGCTCCAT TTGGTACGAG TTGGCTTACA CAGAAGCCAA AGGAAGGATG 132 0 

ACGAAAGGAG ATAGGATTTG GCAGATTGCG TTGGGCTCAG GTTTTAAGTG TAATAGTTCA 1380 
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GTTTGGGTGG CTCTTCGTAA CGTCAAGCCT TCTACTAATA ATCCTTGGGA ACAGTGTCTA 144 0 
CACAAATATC CAGTTGAGAT CGATATAGAT TTAAAAGAG 147 9 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Asp Tyr Pro Met Lys Lys Val Lys lie Phe Phe Asn Tyr Leu Met 

1 5 10 15 

Ala His Arg Phe Lys Leu Cys Phe Leu Pro Leu Met Val Ala lie Ala 

20 25 30 

Val Glu Ala Ser Arg Leu Ser Thr Gin Asp Leu Gin Asn Phe Tyr Leu 

35 40 45 

Tyr Leu Gin Asn Asn His Thr Ser Leu Thr Met Phe Phe Leu Tyr Leu 

50 55 60 

Ala Leu Gly Ser Thr Leu Tyr Leu Met Thr Arg Pro Lys Pro Val Tyr 
65 70 75 80 

Leu Val Asp Phe Ser Cys Tyr Leu Pro Pro Ser His Leu Lys Ala Ser 

85 90 95 

Thr Gin Arg lie Met Gin His Val Arg Leu Val Arg Glu Ala Gly Ala 

100 105 110 

Trp Lys Gin Glu Ser Asp Tyr Leu Met Asp Phe Cys Glu Lys lie Leu 

115 120 125 

Glu Arg Ser Gly Leu Gly Gin Glu Thr Tyr Val Pro Glu Gly Leu Gin 

130 135 140 

Thr Leu Pro Leu Gin Gin Asn Leu Ala Val Ser Arg lie Glu Thr Glu 
145 150 155 160 

Glu Val lie He Gly Ala Val Asp Asn Leu Phe Arg Asn Thr Gly He 

165 170 175 

Ser Pro Ser Asp He Gly He Leu Val Val Asn Ser Ser Thr Phe Asn 

180 185 190 

Pro Thr Pro Ser Leu Ser Ser He Leu Val Asn Lys Phe Lys Leu Arg 

195 200 205 

Asp Asn He Lys Ser Leu Asn Leu Gly Gly Met Gly Cys Ser Ala Gly 

210 215 220 

Val He Ala He Asp Ala Ala Lys Ser Leu Leu Gin Val His Arg Asn 
225 230 235 240 

Thr Tyr Ala Leu Val Val Ser Thr Glu Asn He Thr Gin Asn Leu Tyr 

245 250 255 

Met Gly Asn Asn Lys Ser Met Leu Val Thr Asn Cys Leu Phe Arg He 

260 265 270 

Gly Gly Ala Ala He Leu Leu Ser Asn Arg Ser He Asp Arg Lys Arg 

275 280 285 

Ala Lys Tyr Glu Leu Val His Thr Val Arg Val His Thr Gly Ala Asp 

290 295 300 

Asp Arg Ser Tyr Glu Cys Ala Thr Gin Glu Glu Asp Glu Asp Gly He 
305 310 315 320 

Val Gly Val Ser Leu Ser Lys Asn Leu Pro Met Val Ala Ala Arg Thr 

325 330 335 

Leu Lys He Asn He Ala Thr Leu Gly Pro Leu Val Leu Pro He Ser 

340 345 350 

Glu Lys Phe His Phe Phe Val Arg Phe Val Lys Lys Lys Phe Leu Asn 

355 360 365 

Pro Lys Leu Lys His Tyr He Pro Asp Phe Lys Leu Ala Phe Glu His 

370 375 380 

Phe Cys He His Ala Gly Gly Arg Ala Leu He Asp Glu Met Glu Lys 
385 390 395 400 

Asn Leu His Leu Thr Pro Leu Asp Val Glu Ala Ser Arg Met Thr Leu 
405 410 415 
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His 


Arg 


Phe 


Gly 


Asn 


Thr 


Ser 


Ser 








420 










Tyr 


Thr 


Glu 


Ala 


Lys 


Gly 


Arg 


Met 






435 










440 


He 


Ala 


Leu 


Gly 


Ser 


Gly 


Phe 


Lys 




450 










455 




Leu 


Arg 


Asn 


Val 


Lys 


Pro 


Ser 


Thr 


465 










470 






His 


Lys 


Tyr 


Pro 


Val 


Glu 


He Asp 



485 



Ser 


Ser 


He 


Trp 


Tyr Glu 


Leu 


Ala 


425 










430 






Thr 


Lys 


Gly 


Asp 


Arg 
445 


He 


Trp 


Gin 


Cys 


Asn 


Ser 


Ser 
460 


Val 


Trp 


Val 


Ala 


Asn 


Asn 


Pro 


Trp 


Glu 


Gin 


Cys 


Leu 






475 








480 


He 


Asp 
490 


Leu 


Lys 


Glu 









(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CTACGTCAGG 
TTATCTCCAA 
GCAGGATTAG 
CACCATACAC 
TGGATCCTCT 
CTTCCACCGT 
AGAGAAGCAG 
AAGATTCTTG 
TTCCCACTTC 
GCTCTTGACA 
GTGAATTCTA 
AAACTCAGAG 
ATAGCTGTTG 
GTAAGCACAG 
ACAAACTGTT 
CGTAACCGCG 
AGGTCGTTCG 
ACAAAGAATC 
CCTCTTGTAC 
TATTTCAAGC 
TGTATCCACG 
CCGTTACACG 
TCAATCTGGT 
ATTTGGCAGA 
CGAGACGTTA 
GAGATTGATA 



GTAGAACAAA 
TGAAGAACTT 
CCATGAAAGG 
AGAACAACCT 
ACATGTTAAC 
CGCATCTCAA 
GCATGTGTTG 
AACGTTCCGG 
AGCAAGGCAT 
ATCTTTTTCG 
GCACGTTTAA 
ACAACATCAA 
ATGTCGCTAA 
AGAACATCAC 
TCTTCCGCGT 
CCAAATACGA 
AATGTGCGAC 
TACCTATGGT 
TTCCATTAAA 
CAGAGTTAAG 
CTGGTGGAAG 
TAGAGGCGTC 
ACGAGTTAGC 
TTGCTTTGGG 
AGCCTTCAGC 
TT 



GAGTAAACAC 
AAAGATGGTT 
ATCTAAGATC 
CCAAACCATA 
CCGACCTAAA 
GGTCAGTATC 
GAAGAACAAA 
TCTTGGTCAA 
GGGTGCTTCA 
CAACACCGGT 
TCCAACTCCA 
GAGTTTGAAT 
GGGATTACTA 
TCAGAACTTA 
TGGTGGTGCT 
GCTTGTTCAC 
ACAAGAAGAG 
GGCTGCAAGG 
AGAGAAGCTA 
GAATTATACA 
AGCTCTAATA 
AAGAATGACA 
TTATACAGAA 
GTCAGGTTTT 
TAACAGTCCA 



TTAAGCAAAA 
TTCTTCAAGA 
AACGTAGAAG 
AGCCTTCTAT 
CCCGTTTACC 
CAAACCCTAA 
GAGAGCGACC 
GAAACCTACA 
CGTAAAGAGA 
GTAAAACCTG 
TCACTCGCCT 
CTTGGAGGGA 
CAAGTTCATA 
TACTTGGGGA 
GCGGTTCTGC 
ACCGTACGGA 
GATGAAGATG 
ACTCTTAAGA 
GCCTTCTTTA 
CCAGATTTCA 
GATGAGCTGG 
CTACACAGGT 
GCTAAAGGAA 
AAGTGTAACA 
TGGGAAGACT 



CAATTTGTCC 
TCCTCTTTAT 
ATCTCCAAAA 
TGTTTCTTGT 
TTGTTGATTT 
TGGGACACGC 
ATTTAGTTGA 
TCCCCGAGGG 
CGGAAGAAGT 
ATGATATCGG 
CCATGATTGT 
TGGGTTGCAG 
GGAACACTTA 
AAAACAAATC 
TTTCAAACAG 
TCCATACCGG 
GTATAATTGG 
TAAATATCGC 
TTACTTTTGT 
AGCTTGCCTT 
AGAAGAACCT 
TTGGTAACAC 
GGATGAAGGA 
GTTCAGTATG 
GTATGGATAG 



TACTCTTAGG 
CTCTTTAATG 
GTTCTCCCTC 
CGTTTTTGTG 
CTCCTGCTAC 
AAGACGTGCA 
CTTCCAGGAG 
TCTTCAGTGC 
AATCTTCGGA 
TATATTGGTG 
GAACAAGTAC 
TGCCGGAGTT 
TGCTATTGTA 
AATGCTAGTC 
ATCTAGAGAC 
ATCAGATGAT 
AGTTACCTTG 
AACTTTGGGT 
CAAGAAGAAG 
TGAGCATTTC 
TAAGCTTTCT 
TTCTTCTAGC 
AGGAGATAGG 
GGTGGCTCTG 
ATATCCGGTT 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu Arg Gin Gly Arg Thr Lys Ser Lys His Leu Ser Lys Thr He Cys 

15 10 15 

Pro Thr Leu Arg Leu Ser Pro Met Lys Asn Leu Lys Met Val Phe Phe 
20 25 30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1512 
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Lys lie Leu Phe 
35 

Lys lie Asn Val 
50 

Asn Asn Leu Gin 
65 

Trp lie Leu Tyr 

Phe Ser Cys Tyr 
100 

Leu Met Gly His 
115 

Asn Lys Glu Ser 
130 

Arg Ser Gly Leu 
145 

Phe Pro Leu Gin 

Val lie Phe Gly 
180 

Pro Asp Asp He 
195 

Thr Pro Ser Leu 
210 

Asn He Lys Ser 
225 

He Ala Val Asp 

Tyr Ala He Val 
260 

Gly Lys Asn Lys 
275 

Gly Ala Ala Val 
290 

Lys Tyr Glu Leu 
305 

Arg Ser Phe Glu 

Gly Val Thr Leu 
340 

Lys He Asn He 
355 

Lys Leu Ala Phe 
370 

Glu Leu Arg Asn 
385 

Cys He His Ala 

Leu Lys Leu Ser 
420 

Arg Phe Gly Asn 
435 

Thr Glu Ala Lys 
450 

Ala Leu Gly Ser 
465 

Arg Asp Val Lys 

Arg Tyr Pro Val 
500 



lie 


Ser 


Leu 


Met 








40 


Glu 


Asp 


Leu 


Gin 






55 




Thr 


He 


Ser 


Leu 




70 






Met 


Leu 


Thr 


Arg 


D C 






Leu 


Pro 


Pro 


Ser 


Ala 


Arg 


Arg 


Ala 








120 


Asp 


His 


Leu 


Val 






135 




Gly 


Gin 


Glu 


Thr 


150 






Gin* 


Gly 


Met 


Gly 


165 








Ala 


Leu 


Asp 


Asn 


Gly 


He 


Leu 


Val 








200 


Ala 


Ser 


Met 


He 






215 




Leu 


Asn 


Leu 


Gly 




230 






Val 


Ala 


Lys 


Gly 


245 








Val 


Ser 


Thr 


Glu 


Ser 


Met 


Leu 


Val 








280 


Leu 


Leu 


Ser 


Asn 






295 




Val 


Has 


Thr 


Val 










Cys 


Ala 


Thr 


Gin 










Thr 


Lys 


Asn 


Leu 


Ala 


Thr 


Leu 


Gly 








360 


Phe 


He 


Thr 


Phe 






375 




Tyr 


Thr 


Pro 


Asp 




a r\ 

390 






Gly 


Gly 


Arg 


Ala 


405 








Pro 


Leu 


His 


Val 


Thr 


Ser 


Ser 


Ser 








440 


Gly 


Arg 


Met 


Lys 






455 




Gly 


Phe 


Lys 


Cys 




470 






Pro 


Ser 


Ala 


Asn 


485 








Glu 


He 


Asp 


He 
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Ala 


Gly 


Leu 


Ala 


Lys 


Phe 


Ser 


Leu 








60 


Leu 


Leu 


Phe 


Leu 






75 




Pro 


Lys 


Pro 


Val 




90 






His 


Leu 


Lys 


Val 


105 








Arg 


Glu 


Ala 


Gly 


Asp 


Phe 


Gin 


Glu 








140 


Tyr 


He 


Pro 


Glu 






155 




Ala 


Ser 


Arg 


Lys 




170 






Leu 


Phe 


Arg 


Asn 


185 








Val 


Asn 


Ser 


Ser 


Val 


Asn 


Lys 


Tyr 








220 


Gly 


Met 


Gly 


Cys 






235 




Leu 


Leu 


Gin 


Val 




250 






Asn 


He 


Thr 


Gin 


265 








Thr 


Asn 


Cys 


Leu 


Arg 


Ser 


Arg 


Asp 








300 


Arg 


He 


His 


Thr 






315 




Glu 


Glu 


Asp 


Glu 




330 






Pro 


Met 


Val 


Ala 


345 








Pro 


Leu 


Val 


Leu 


Val 


Lys 


Lys 


Lys 








380 


Phe 


Lys 


Leu 


Ala 






395 




Leu 


He 


Asp 


Glu 










Glu 


Ala 


Ser 


Arg 


425 






Ser 


He 


Trp 


Tyr 


Glu 


Gly 


Asp 


Arg 








460 


Asn 


Ser 


Ser 


Val 






475 




Ser 


Pro 


Trp 


Glu 



490 



Met 


Lys 


Gly 


Ser 


45 








His 


His 


Thr 


Gin 


Val 


Val 


Phe 


Val 








80 


Tyr 


Leu 


Val 


Asp 






95 




Ser 


He 


Gin 


Thr 




110 






Met 


Cys 


Trp 


Lys 


125 








Lys 


He 


Leu 


Glu 


Gly 


Leu 


Gin 


Cys 








160 


Glu 


Thr 


Glu 


Glu ' 






175 




Thr 


Gly 


Val 


Lys 




190 






Thr 


Phe 


Asn 


Pro 


205 








Lys 


Leu 


Arg 


Asp 


Ser 


Ala 


Gly 


Val 








240 


His 


Arg 


Asn 


Thr 






255 




Asn 


Leu 


Tyr 


Leu 




270 






Phe 


Arg 


Val 


Gly 


285 








Arg 


Asn 


Arg 


Ala 


Gly 


Ser 


Asp 


Asp 








320 


Asp 


Gly 


He 


He 






335 




Ala 


Arg 


Thr 


Leu 




350 






Pro 


Leu 


Lys 


Glu 


365 






Tyr 


Phe 


Lys 


Pro 


Phe 


Glu 


His 


Phe 








400 


Leu 


Glu 


Lys 


Asn 






415 




Met 


Thr 


Leu 


His 




430 






Glu 


Leu 


Ala 


Tyr 


445 








He 


Trp 


Gin 


He 


Trp 


Val 


Ala 


Leu 








480 


Asp 


Cys 


Met 


Asp 



495 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATGGGTAGAT CCAACGAGCA AGATCTGCTC TCTACCGAGA TCGTTAATCG TGGGATCGAA 60 

CCATCCGGTC CTAACGCCGG CTCACCAACG TTCTCGGTTA GGGTCAGGAG ACGTTTGCCT 120 

GATTTTCTTC AGTCGGTGAA CTTGAAGTAC GTGAAACTTG GTTACCACTA CCTCATAAAC 180 

CATGCGGTTT ATTTGGCGAC CATACCGGTT CTTGTGCTGG TTTTTAGTGC TGAGGTTGGG 240 

AGTTTAAGCA GAGAAGAGAT TTGGAAGAAG CTTTGGGACT ATGATCTTGC AACTGTTATC 3 00 

GGATTCTTCG GTGTCTTTGT TTTAACCGCT TGTGTCTACT TCATGTCTCG TCCTCGCTCT 360 

GTTTATCTTA TTGATTTCGC TTGTTACAAG CCCTCCGATG AACACAAGGT GACAAAAGAA 420 

GAGTTCATAG AACTAGCGAG AAAATCAGGG AAGTTCGACG AAGAGACACT CGGTTTCAAG 480 

AAGAGGATCT TACAAGCCTC AGGCATAGGC GACGAGACAT ACGTCCCAAG ATCCATCTCT 540 

TCATCAGAAA ACATAACAAC GATGAAAGAA GGTCGTGAAG AAGCCTCTAC AGTGATCTTT 600 

GGAGCACTAG ACGAACTCTT CGAGAAGACA CGTGTAAAAC CTAAAGACGT TGGTGTCCTT 660 

GTGGTTAACT GTAGCATTTT CAACCCGACA CCGTCGTTGT CCGCAATGGT GATAAACCAT 720 
TACAAGATGA GAGGGAACAT ACTTAGTTAC AACCTTGGAG GGATGGGATG TTCGGCTGGA 780 
ATCATAGCTA TTGATCTTGC TCGTGACATG CTTCAGTCTA ACCCTAATAG TTATGCTGTT 840 
GTTGTGAGTA CTGAGATGGT TGGGTATAAT TGGTACGTGG GAAGTGACAA GTCAATGGTT 900 
ATACCTAATT GTTTCTTTAG GATGGGTTGT TCTGCCGTTA TGCTCTCTAA CCGTCGTCGT 960 

GACTTTCGCC ATGCTAAGTA CCGTCTCGAG CACATTGTCC GAACTCATAA GGCTGCTGAC 1020 

GACCGTAGCT TCAGGAGTGT GTACCAGGAA GAAGATGAAC AAGGATTCAA GGGGTTGAAG 1080 

ATAAGTAGAG ACTTAATGGA AGTTGGAGGT GAAGCTCTCA AGACAAACAT CACTACCTTA 1140 

GGTCCTCTTG TCCTACCTTT CTCCGAGCAG CTTCTCTTCT TTGCTGCTTT GGTCCGCCGA 1200 

ACATTCTCAC CTGCTGCCAA AACGTCCACA ACCACTTCCT TCTCTACTTC CGCCACCGCA 1260 

AAAACCAATG GAATCAAGTC TTCCTCTTCC GATCTGTCCA AGCCATACAT CCCGGACTAC 1320 

AAGCTCGCCT TCGAGCATTT TTGCTTCCAC GCGGCAAGCA AAGTAGTGCT TGAAGAGCTT 1380 

CAAAAGAATC TAGGCTTGAG TGAAGAGAAT ATGGAGGCTT CTAGGATGAC ACTTCACAGG 1440 

TTTGGAAACA CTTCTAGCAG TGGAATCTGG TATGAGTTGG CTTACATGGA GGCCAAGGAA 1500 

AGTGTTCGTA GAGGCGATAG GGTTTGGCAG ATCGCTTTCG GTTCTGGTTT TAAGTGTAAC 1560 

AGTGTGGTGT GGAAGGCAAT GAGGAAGGTG AAGAAGCCAA CCAGGAACAA TCCTTGGGTG 1620 

GATTGCATCA ACCGTTACCC TGTGCCTCTC 1650 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid . < 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 


Gly 


Arg 


Ser 


Asn 


Glu 


Gin 


Asp Leu 


Leu 


Ser 


Thr Glu He Val Asn 


1 




5 






10 




15 


Arg 


Gly 


He 


Glu 


Pro 


Ser 


Gly 


Pro Asn 


Ala 


Gly 


Ser Pro Thr Phe Ser 




20 








25 






30 


Val 


Arg 


Val 


Arg 


Arg 


Arg 


Leu 


Pro Asp 


Phe 


Leu 


Gin Ser Val Asn Leu 






35 










40 






45 


Lys 


Tyr 


Val 


Lys 


Leu 


Gly 


Tyr 


His Tyr 


Leu 


He 


Asn His Ala Val Tyr 


50 








55 








60 


Leu 


Ala 


Thr 


He 


Pro 


Val 


Leu 


Val Leu 


Val 


Phe 


Ser Ala Glu Val Gly 


65 










70 








75 


80 


Ser 


Leu 


Ser 


Arg 


Glu 


Glu 


He 


Trp Lys 


Lys 


Leu 


Trp Asp Tyr Asp Leu 










85 








90 




95 


Ala 


Thr 


Val 


He 


Gly 


Phe 


Phe 


Gly Val 


Phe 


Val 


Leu Thr Ala Cys Val 



100 105 HO 
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TSfr* DVia Mat" 
iyi trXlC nCU 


OCX 


Arg 


Pro 


Arg 


oer vax ryr 


Leu 


He Asp 


fne 


Ala 


Cys 


115 














1Z -> 






THr - *- T,v5 Drn 

lyX xjy & ri. v 


OCX 


ASp 


ylu 


nis 


T T/e Val TVi v 

xiys val inr 


Lys 


ulu blU 


rile 


He 


Glu 


130 








X J 






X 4 * u 








ucu rtia fvxy 


xiy a 


OCX 


Hi xr 
Valy 


iiy 0 


l^Ha len ^Tai 




'P V> V T All 

1 ill XJCU 


vjiy 


nit a 

Pne 


Lys 


xt» j 






1 RH 














IOU 


T.Tra Ayn Tlf* 
*jy a y xxc 


T1A11 

XJCU 


f!l n 


al a 

ax a 


Cat* 

OCX 


r?l v Tl P> fJlv 
\jiy xic vixy 


Asp 


OlU llix 


Tyr 


17a 1 

vai 


Pro 






165 






170 
x / u 








1 75 
1 / 0 




Atv* Cat" Tip 


Ser 


OCX 


OCX 


v7X Li 


Acn Tl p Tlry* 

ZiSU XIC X lUL 


xiir 


Ma r* T.vrcs 

net. ijys 


ulu 


Liiy 


Arg 




180 








X O— ' 






1 on 

X5* u 


(Zln f^ln Ala 


Car 

OCX 


inr 


Va 1 
Val 


T 1 A 

lie 


DVt a nl \r Ala 
rTie vaiy Ala 


T All 

iieu 


ASp (jXU 


T All 
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Pne 
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T »»a T Vi t* & T*rr 
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- 
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oci lie rne 
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oer lieu oer 


Ala 
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net vai 
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Asn 


HIS 
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Z JO 
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AX CJ 
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Asn 


lie 


lieu oer lyr 


Asn 


Leu Gly 




Ma4* 

Met 


p T _ _ 

Cjiy 






945 
Z^D 






9 n 

Z O VJ 








Z33 




LyS iici Ala 


n") ir 


Tl 0 

lie 


Tl A 
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x xe Asp iieu 


ax a 


Arg Asp 


Ma4- 

Met 


T Am* 

Leu 


uxn 




9 £ A 














9 T fi 

z / U 






0,a^~ TVbti Dv>a 

ocx ash irto 


Asn 


oer 


Tyr 


Axa 


17a 1 Ua 1 17a 1 

vax vax vax 


Ser 


inr (jiu 


Ma^ 


17a 1 

vai 


pi _ _ 
osxy 


975 

Z I D 










Z 0 V 










Iyi ASn l^p 


lyr 


17a 1 

vax 


oxy 


Ser 


Asp Lys Ser 


Mec 


17a 1 T 1 a 

vai lie 


Pro 


Asn 


Cys 


Z 7U 














^ n n 








rile file Axy 


net 


tjxy 


Cys 


Ser 


2i 1 a 17a 1 Ma+- 

Aia vai nee 


r ... 

x»eu 


Ser Asn 


Arg 


Arg 


Arg 


-3 V D 






J 1 u 






J 13 








*i *5 r» 
oz U 


ASp Pne ATy 


HIS 


Axa 


Lys 


Tyr 


Arg Leu Glu 


xlXS 


T 1 a 17a 1 

lie vai 


Arg 


Thr 


nlS 




















Tic 




T irH Ala Til * 

J_iy S Ala Ala 


ASp 


Asp 


Arg 


oer 


rue Axg ocx 


17a 1 

vai 


- 

xyr Gin 


r»i 11 

vjXU 


/"•In 
ulu 


ASp 




740 




















uiu uin O-Ly 


JrliC 


Lys 


oiy 


_ 

Leu 


jjys lie oer 


Arg 


& on T _ai 1 
ASp bCU 


ncL 


pi 
V?1U 


17a 1 

vax 


355 
j j j 














3fi5 








uiy vjxy uxu 


AT a 

rYXa 


T.ati 


T A. yes 

x>y o 


X ux 


Aan Tla T'V^v 
nSii IXC lilX 


X XIX 


T .am \r 
xjCU ux y 


Dm 
riu 


_ 

i^eu 


Val 
Val 


/ w 








375 






390 

J O V 








T.ah Ot*^ DVia 
LicU Jrx O Ir lie 


C a>* 
Ocx 


m 11 

uJLU 


m rt 

VJlU 


Leu 


T.A11 DY^A 

iieu irne <rne 


Ala 


Ala T Ail 

Ala lieu 


Val 

vax 


Arg 


Arg 


3ft 5 






300 






0 yo 








TKr DVia Cot* 
1 xix xrilc ocx. 




Al a 
Ala 


Al a 
Ala 


xiy s 


TV^i^ Cat* TV^t" 
± IlX OCX 1 XIX 


1X1X7 


T*V\ Cat* 
X 11 x oex 


PVlA 


OCX 


IXlx 






•* w «J 
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"ft X w 
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*± X-> 




O a v* Al a T Vi 
OCX Al Cl lilX 


AT a 


Lys 


llxx 


Asn 


fll Tl A T trc 

biy ne Jjys 


OCX 


Ser ser 


ser 


Asp 
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47(1 












430 






ocx JjyS rlO 


Tyr 


Tl A 

lie 


Pro 


Asp 


iyr iiys lieu 


Jk 1 a 

Aia 


DVt a r^l 11 
rue bill 


illS 


DVi a 

pne 


Cys 


435 










44D 




445 








rile IllS AX 3. 


AXa 


C* a*» 

oer 


Lys 


vax 


vai Leu Liiu 


CjXU 


T ai 1 nT ri 

lieu <jin 


Lys 


Asn 


T All 

iieu 


4 <;n 














A C f\ 








taxy ijeu oer 


VaXU 


LslU 


Asn 


riec 


oiu Aia ber 


Arg 


wet inr 


T All 

iieu 


£11 S 


Arg 


465 






470 






475 








480 


Phe Gly Asn 


Thr 


Ser 


Ser 


Ser 


Gly He Trp 


Tyr 


Glu Leu 


Ala 


Tyr 


Met 






485 






490 








495 




Glu Ala Lys 


Glu 


Ser 


Val 


Arg 


Arg Gly Asp 


Arg 


Val Trp 


Gin 


He 


Ala 




500 








505 






510 






Phe Gly Ser 


Gly 


Phe 


Lys 


Cys 


Asn Ser Val 


Val 


Trp Lys 


Ala 


Met 


Arg 


515 










520 




525 








Lys Val Lys 


Lys 


Pro 


Thr 


Arg 


Asn Asn Pro 


Trp 


Val Asp 


Cys 


He 


Asn 


530 








535 






540 








Arg Tyr Pro 


Val 


Pro 


Leu 

















545 550 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



TCGAGCTACG TCAGGGCTTT TATATGCACA AATTCTCATA AAGTTTTCAA TTTTATTCCA 60 

TTTTTCTCGG AAGCCATGGA AGCTGCTAAT GAGCCTGTTA ATGGCGGATC CGTACAGATC 120 

CGAACAGAGA ACAACGAAAG ACGAAAGCTT CCTAATTTCT TACAAAGCGT CAACATGAAA 180 

TACGTCAAGC TAGGTTATCA TTACCTCATT ACTCATCTCT TCAAGCTCTG TTTGGTTCCA 24 0 

TTAATGGCGG TTTTAGTCAC AGAGATCTCT CGATTAACAA CAGACGATCT TTACCAGATT 300 

TGGCTTCATC TCCAATACAA TCTCGTTGCT TTCATCTTTC TCTCTGCTTT AGCTATCTTT 360 

GGCTCCACCG TTTACATCAT GAGTCGTCCC AGATCTGTTT ATCTCGTTGA TTACTCTTGT 420 

TATCTTCCTC CGGAGAGTCT TCAGGTTAAG TATCAGAAGT TTATGGATCA TTCTAAGTTG 480 

ATTGAAGATT TCAATGAGTC ATCTTTAGAG TTTCAGAGGA AGATTCTTGA ACGTTCTGGT 540 

TTAGGAGAAG AGACTTATCT CCCTGAAGCT TTACATTGTA TCCCTCCGAG GCCTACGATG 600 

ATGGCGGCTC GTGAGGAATC TGAGCAGGTA ATGTTTGGTG CTCTTGATAA GCTTTTCGAG 660 

AATACCAAGA TTAACCCTAG GGATATTGGT GTGTTGGTTG TGAATTGTAG CTTGTTTAAT 720 

CCTACACCTT CGTTGTCAGC TATGATTGTT AACAAGTATA AGCTTAGAGG GAATGTTAAG 780 

AGTTTTAACC TTGGTGGAAT GGGGTGTAGT GCTGGTGTTA TCTCTATCGA TTTAGCTAAA 840 

GATATGTTGC AAGTTCATAG GAATACTTAT GCTGTTGTGG 'TTAGTACTGA GAACATTACT 900 

CAGAATTGGT ATTTTGGGAA TAAGAAGGCT ATGTTGATTC CGAATTGTTT GTTTCGTGTT 960 

GGTGGTTCGG CGATTTTGTT GTCGAACAAG GGGAAAGATC GTAGACGGTC TAAGTATAAG 1020 

CTTGTTCATA CCGTTAGGAC TCATAAAGGA GCTGTTGAGA AGGCTTTCAA CTGTGTTTAC 1080 

CAAGAGCAAG ATGATAATGG GAAGACCGGG GTTTCGTTGT CGAAAGATCT TATGGCTATA 114 0 

GCTGGGGAAG CTCTTAAGGC GAATATCACT ACTTTAGGTC CTTTGGTTCT TCCTATAAGT 1200 

GAGCAGATTC TGTTTTTCAT GACTTTGGTT ACGAAGAAAC TGTTTAACTC GAAGCTGAAG 1260 

CCGTATATTC CGGATTTCAA GCTTGCGTTT GATCATTTCT GTATCCATGC TGGTGGTAGA 1320 

GCTGTGATTG ATGAGCTTGA GAAGAATCTG CAGCTTTCGC AGACTCATGT CGAGGCATCC 1380 

AGAATGACAC TGCACAGATT TGGAAACACT TCTTCGAGCT CGATTTGGTA TGAACTGGCT 1440 

TACATAGAGG CTAAAGGTAG GATGAAGAAA GGAAACCGGG TTTGGCAGAT TGCTTTTGGA 1500 

AGTGGGTTTA AGTGTAACAG TGCAGTTTGG GTGGCTCTAA ACAATGTCAA GCCTTCGGTT 1560 

AGTAGTCCGT GGGAACACTG CATCGACCGA TATCCGGTTA AGCTCGACTT C 1611 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Ser Tyr Val Arg Ala Phe lie Cys Thr Asn Ser His Lys Val Phe 

15 10 15 

Asn Phe He Pro Phe Phe Ser Glu Ala Met Glu Ala Ala Asn Glu Pro 

20 25 30 

Val Asn Gly Gly Ser Val Gin He Arg Thr Glu Asn Asn Glu Arg Arg 

35 40 45 

Lys Leu Pro Asn Phe Leu Gin Ser Val Asn Met Lys Tyr Val Lys Leu 

50 55 60 

Gly Tyr His Tyr Leu He Thr His Leu Phe Lys Leu Cys Leu Val Pro 
65 70 75 80 

Leu Met Ala Val Leu Val Thr Glu He Ser Arg Leu Thr Thr Asp Asp 

85 90 95 

Leu Tyr Gin He Trp Leu His Leu Gin Tyr Asn Leu Val Ala Phe He 

100 105 110 

Phe Leu Ser Ala Leu Ala He Phe Gly Ser Thr Val Tyr He Met Ser 

115 120 125 

Arg Pro Arg Ser Val Tyr Leu Val Asp Tyr Ser Cys Tyr Leu Pro Pro 

130 135 140 

Glu Ser Leu Gin Val Lys Tyr Gin Lys Phe Met Asp His Ser Lys Leu 
145 150 155 160 

He Glu Asp Phe Asn Glu Ser Ser Leu Glu Phe Gin Arg Lys He Leu 

165 170 175 

Glu Arg S r Gly Leu Gly Glu Glu Thr Tyr Leu Pro Glu Ala Leu His 
180 185 190 
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Cys lie Pro Pro Arg Pro Thr Met Met Ala Ala Arg Glu Glu Ser Glu 

195 200 205 

Gin Val Met Phe Gly Ala Leu Asp Lys Leu Phe Glu Asn Thr Lys He 

210 215 220 

Asn Pro Arg Asp He Gly Val Leu Val Val Asn Cys Ser Leu Phe Asn 
225 230 235 240 

Pro Thr Pro Ser Leu Ser Ala Met He Val Asn Lys Tyr Lys Leu Arg 

245 250 255 

Gly Asn Val Lys Ser Phe Asn Leu Gly Gly Met Gly Cys Ser Ala Gly 

260 265 270 

Val He Ser He Asp Leu Ala Lys Asp Met Leu Gin Val His Arg Asn 

275 280 285 

Thr Tyr Ala Val Val Val Ser Thr Glu Asn He Thr Gin Asn Trp Tyr 

290 295 300 

Phe Gly Asn Lys Lys Ala Met Leu He Pro Asn Cys Leu Phe Arg Val 
305 , 310 315 . 320 

Gly Gly Ser Ala He Leu Leu Ser Asn Lys Gly Lys Asp Arg Arg Arg 

325 330 335 

Ser Lys Tyr Lys Leu Val His Thr Val Arg Thr His Lys Gly Ala Val 

340 345 350 

Glu Lys Ala Phe Asn Cys Val Tyr Gin Glu Gin Asp Asp Asn Gly Lys 

355 360 365 

Thr Gly Val Ser Leu Ser Lys Asp Leu Met Ala He Ala Gly Glu Ala 

370 375 380 

Leu Lys Ala Asn He Thr Thr Leu Gly Pro Leu Val Leu Pro He Ser 
385 390 395 400 

Glu Gin He Leu Phe Phe Met Thr Leu Val Thr Lys Lys Leu Phe Asn 

405 410 415 

Ser Lys Leu Lys Pro Tyr He Pro Asp Phe Lys Leu Ala Phe Asp His 

420 425 430 

Phe Cys He His Ala Gly Gly Arg Ala Val He Asp Glu Leu Glu Lys 

435 440 445 

Asn Leu Gin Leu Ser Gin Thr His Val Glu Ala Ser Arg Met Thr Leu 

450 455 460 

His Arg Phe Gly Asn Thr Ser Ser Ser Ser He Trp Tyr Glu Leu Ala 
465 470 475 480 

Tyr He Glu Ala Lys Gly Arg Met Lys Lys Gly Asn Arg Val Trp Gin 

485 490 495 

He Ala Phe Gly Ser Gly Phe Lys Cys Asn Ser Ala Val Trp Val Ala 

500 505 510 

Leu Asn Asn Val Lys Pro Ser Val Ser Ser Pro Trp Glu His Cys He 

515 520 525 

Asp Arg Tyr Pro Val Lys Leu Asp Phe 
530 535 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TCTCCGACGA TGCCTCAGGC ACCGATGCCA GAGTTCTCTA G CTCG GTGAA GCTCAAGTAC 60 

GTGAAACTTG GTTACCAATA TTTGGTTAAC CATTTCTTGA GTTTTCTTTT GATCCCGATC 120 

ATGGCTATTG TCGCCGTTGA GCTTCTTCGG ATGGGTCCTG AAGA GATC CT TAATGTTTGG 180 

AATTCACTCC AGTTTGACCT AGTTCAGGTT CTATGTTCTT CCTTCTTTGT CATCTTCATC 240 

TCCACTGTTT ACTTCATGTC CAAGCCACGC ACCATCTACC TCGTTGACTA TTCTTGTTAC 300 

AAGCCACCTG TCACGTGTCG TGTCCCCTTC GCAACTTTCA TGGAACACTC TCGTTTGATC 360 

CTCAAGGACA AGCCTAAGAG CGTCGAGTTC CAAATGAGAA TCCTTGAACG TTCTGGCCTC 420 

GGTGAGGAGA CTTGTCTCCC TCCGGCTATT CATTATATTC CTCCCACACC AACCATGGAC 480 

GCGGCTAGAA GCGAGGCTCA GATGGTTATC TTCGAGGCCA TGGACGATCT TTTCAAGAAA 540 

ACCGGTCTTA AACCTAAAGA CGTCGACATC CTTATCGTCA ACTGCTCTCT TTTCTCTCCC 600 
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ACACCATCGC TCTCAGCTAT GGTCATCAAC AAATATAAGC TTAGGAGTAA TATCAAGAGC 660 

TTCAATCTTT CGGGGATGGG CTGCAGCGCG GGCCTGATCT CAGTTGATCT AGCCCGCGAC 72 0 

TTGCTCCAAG TTCATCCCAA TTCAAATGCA ATCATCGTCA GCACGGAGAT CATAACGCCT 780 

AATTACTATC AAGGCAACGA GAGAGCCATG TTGTTACCCA ATTGTCTCTT CCGCATGGGT 840 

GCGGCAGCCA TACACATGTC AAACCGCCGG TCTGACCGGT GGCGAGCCAA ATACAAGCTT 900 

TCCCACCTCG TCCGGACACA CCGTGGCGCT GACGACAAGT CTTTCTACTG TGTCTACGAA 96 0 

CAGGAAGACA AAGAAGGACA CGTTGGCATC AACTTGTCCA AAGATCTCAT GGCCATCGCC 1020 

GGTGAAGCCC TCAAGGCAAA CATCACCACA ATAGGTCCTT TGGTCCTACC GGCGTCAGAA 1080 

CAACTTCTCT TCCTCACGTC CCTAATCGGA CGTAAAATCT TCAACCCGAA ATGGAAACCA 114 0 

TACATACCGG ATTTCAAGCT GGCCTTCGAA CACTTTTGCA TTCACGCAGG AGGCAGAGCG 1200 

GTGATCGACG AGCTCCAAAA GAATCTACAA CTATCAGGAG AACACGTTGA GGCCTCAAGA 1260 

ATGACACTAC ATCGTTTTGG TAACACGTCA TCTTCATCGT TATGGTACGA GCTTAGCTAC 132 0 

ATCGAGTCTA AAGGGAGAAT GAGGAGAGGC GATCGCGTTT GGCAAATCGC GTTTGGGAGT 1380 

GGTTTCAAGT GTAACTCTGC CGTGTGGAAG TGTAACCGTA CGATTAAGAC ACCTAAGGAC 144 0 

GGACCATGGT CCGATTGTAT CGACCGTTAC CCTGTCTTTA TTCCCGAAGT TGTCAAACTC 1500 

TA , 1502 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5Q0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Ser Pro Thr Met 

1 

Lys Leu Lys Tyr 
20 

Leu Ser Phe Leu 
35 

Leu Arg Met Gly 
50 

Phe Asp Leu Val 
65 

Ser Thr Val Tyr 

Tyr Ser Cys Tyr 
100 

Phe Met Glu His 
115 

Glu Phe Gin Met 
130 

Cys Leu Pro Pro 
145 

Ala Ala Arg Ser 

Leu Phe Lys Lys 
180 

Val Asn Cys Ser 
195 

lie Asn Lys Tyr 
210 

Gly Met Gly Cys 
225 

Leu Leu Gin Val 

lie He Thr Pro 

260 

Pro Asn Cys Leu 
275 

Arg Arg Ser Asp 
290 



Pro Gin Ala Pro Met Pro Glu Phe Ser Ser Ser Val 

5 10 15 

Val Lys Leu Gly Tyr Gin Tyr Leu Val Asn His Phe 

25 30 
Leu He Pro He Met Ala He Val Ala Val Glu Leu 

40 45 
Pro Glu Glu He Leu Asn Val Trp Asn Ser Leu Gin 

55 60 
Gin Val Leu Cys Ser Ser Phe Phe Val He Phe He 

70 75 80 

Phe Met Ser Lys Pro Arg Thr He Tyr Leu Val Asp 
85 90 95 

Lys Pro Pro Val Thr Cys Arg Val Pro Phe Ala Thr 

105 110 
Ser Arg Leu He Leu Lys Asp Lys Pro Lys Ser Val 

120 125 
Arg He Leu Glu Arg Ser Gly Leu Gly Glu Glu Thr 

135 140 
Ala He His Tyr He Pro Pro Thr Pro Thr Met Asp 
150 155 160 

Glu Ala Gin Met Val He Phe Glu Ala Met Asp Asp 
165 170 175 

Thr Gly Leu Lys Pro Lys Asp Val Asp He Leu He 

185 190 
Leu Phe Ser Pro Thr Pro Ser Leu Ser Ala Met Val 

200 205 
Lys Leu Arg Ser Asn He Lys Ser Phe Asn Leu Ser 

215 220 
Ser Ala Gly Leu He Ser Val Asp Leu Ala Arg Asp 
230 235 240 

His Pro Asn Ser Asn Ala He He Val Ser Thr Glu 
245 250 255 

Asn Tyr Tyr Gin Gly Asn Glu Arg Ala Met Leu Leu 

265 270 
Phe Arg Met Gly Ala Ala Ala He His Met Ser Asn 

280 285 
Arg Trp Arg Ala Lys Tyr Lys Leu Ser His Leu Val 
295 300 



WO 98/54954 



PCT/CS98/ 11384 



- 48 - 



Arg 


inr 


HIS 


Arg 


uly 


Ala 


Asp 


Asp 


Lys 


Ser 


Phe Tyr Cys 


Vai 


Tyr 


Glu 


OAR 










J1U 










jib 




O O A 

320 


Gin 


Calll 


Asp 


i*ys 


GlU 


Gly 


His 


Vai 


Gly 


lie 


Asn Leu Ser 


Lys 


Asp 


Leu 










Jzi> 










1 O A 

330 




335 






A1£L 


ne 


Alel 


v»iy 


CjIU 


Ala 


Leu 


T • rem 

Lys 


Ala 


Asn He Thr 


Thr 


He 


Gly 








1 A ft 










1 A C 






350 




Pro 


Lieu 


vai 


.Leu 


Pro 


Ala 


Ser 


Glu 


Gin 


Leu 


Leu Phe Leu 


Thr 


Ser 


Leu 






-ice 
odd 










O ^ ft 

360 






365 








lie 


pi,, 
uiy 


Arg 


Lys 


lie 


pne 


Asn 


Pro 


Lys 


Trp 


Lys Pro Tyr 


He 


Pro 


Asp 




J / U 


















*3 Q A 






pne 


Lys 


T All 


Aia 


rne 


vjIU 


TT ■ _ 

HIS 


Pne 


Cys 


Tl- 
116 


His Ala Gly 


Gly Arg 


Ala 












^ Qft 
















400 


vai 


lie 


ASp 


blU 


Leu 


(jin 


Lys 


Asn 


Leu 


Gin 


Leu Ser Gly 


Glu 


His 


Vai 










405 










A1 ft 
ft X u 




415 




Glu 


Ala 


Ser 


Arg 
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Ser 


Ser 
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425 
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Ser 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGGACGGTG CCGGAGAATC ACGACTCGGT GGTGATGGTG GTGGTGATGG TTCTGTTGGA 60 

GTTCAGATCC GACAAACACG GATGCTACCG GATTTTCTCC AGAGCGTGAA TCTCAAGTAT 120 

GTGAAATTAG GTTACCATTA CTTAATCTCA AATCTCTTGA CTCTCTGTTT ATTCCCTCTC 180 

GCCGTTGTTA TCTCCGTCGA AGCCTCTCAG ATGAACCCAG ATGATCTCAA ACAGCTCTGG 240 

ATCCATCTAC AATACAATCT GGTTAGTATC ATCATCTGTT CAGCGATTCT AGTCTTCGGG 300 

TTAACGGTTT ATGTTATGAC CCGACCTAGA CCCGTTTACT TGGTTGATTT CTCTTGTTAT 360 

CTCCCACCTG ATGATCTCAA AGCTCCTTAC GCTCGGTTCA TGGAACATTC TAGACTCACC 420 

GGAGATTTCG ATGACTCTGC TCTCGAGTTT CAACGCAAGA TCCTTGAGCG TTCTGGTTTA 480 

GGGGAAGACA CTTATGTCCC TGAAGCTATG CATTATGTTC CACCGAGAAT TTCAATGGCT 540 

GCTGCTAGAG AAGAAGCTGA ACAAGTCATG TTTGGTGCTT TAGATAACCT TTTCGCTAAC 600 

ACTAATGTGA AACCAAAGGA TATTGGAATC CTTGTTGTGA ATTGTAGTCT CTTTAATCCA 660 

ACTCCTTCGT TATCTGCAAT GATTGTGAAC AAGTATAAGC TTAGAGGTAA CATTAGAAGC 720 

TACAATCTAG GCGGTATGGG TTGCAGCGCG GGAGTTATCG CTGTGGATCT TGCTAAAGAC 780 

ATGTTGTTGG TACATAGGAA CACTTATGCG GTTGTTGTTT CTACTGAGAA CATTACTCAG 840 

AATTGGTATT TTGGTAACAA GAAATCGATG TTGATACCGA ACTGCTTGTT TCGAGTTGGT 900 

GGCTCTGCGG TTTTGCTATC GAACAAGTCG AGGGACAAGA GACGGTCTAA GTACAGGCTT 960 

GTACATGTAG TCAGGACTCA CCGTGGAGCA GATGATAAAG CTTTCCGTTG TGTTTATCAA 1020 

GAGCAGGATG ATACAGGGAG AACCGGGGTT TCGTTGTCGA AAGATCTAAT GGCGATTGCA 1080 

GGGGAAACTC TCAAAACCAA TATCACTACA TTGGGTCCTC TTGTTCTACC GATAAGTGAG 1140 

CAGATTCTCT TCTTTATGAC TCTAGTTGTG AAGAAGCTCT TTAACGGTAA AGTGAAACCG 1200 

TATATCCCGG ATTTCAAACT TGCTTTCGAG CATTTCTGTA TCCATGCTGG TGGAAGAGCT 1260 

GTGATCGATG AGTTAGAGAA GAATCTGCAG CTTTCACCAG TTCATGTCGA GGCTTCGAGG 1320 

ATGACTCTTC ATCGATTTGG TAACACATCT TCGAGCTCCA TTTGGTATGA ATTGGCTTAC 1380 

ATTGAAGCGA AGGGAAGGAT GCGAAGAGGT AATCGTGTTT GGCAAATCGC GTTCGGAAGT 1440 

GGATTTAAAT GTAATAGCGC GATTTGGGAA GCATTAAGGC ATGTGAAACC TTCGAACAAC 1500 

AGTCCTTGGG AAGATTGTAT TGACAAGTAT CCGGTAACTT TAAGTTAT 1548 

(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 516 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Asp Gly Ala Gly Glu Ser Arg Leu Gly Gly Asp Gly Gly Gly Asp 

15 10 15 

Gly Ser Val Gly Val Gin lie Arg Gin Thr Arg Met Leu Pro Asp Phe 

20 25 30 

Leu Gin Ser Val Asn Leu Lys Tyr Val Lys Leu Gly Tyr His Tyr Leu 

35 40 45 

lie Ser Asn Leu Leu Thr Leu Cys Leu Phe Pro Leu Ala Val Val lie 

50 55 60 

Ser Val Glu Ala Ser Gin Met Asn Pro Asp Asp Leu Lys Gin Leu Trp 
65 70 75 80 

He His Leu Gin Tyr Asn Leu Val Ser He He He Cys Ser Ala He 

85 90 95 

Leu Val Phe Gly Leu Thr Val Tyr Val Met Thr Arg Pro Arg Pro Val 

100 105 110 

Tyr Leu Val Asp Phe Ser Cys Tyr Leu Pro Pro Asp His Leu Lys Ala 

115 120 125 

Pro Tyr Ala Arg Phe Met Glu His Ser Arg Leu Thr Gly Asp Phe Asp 

130 135 140 

Asp Ser Ala Leu Glu Phe Gin Arg Lys He Leu Glu Arg Ser Gly Leu 
145 150 155 160 

Gly Glu Asp Thr Tyr Val Pro Glu Ala Met His Tyr Val Pro Pro Arg 

165 170 175 

He Ser Met Ala Ala Ala Arg Glu Glu Ala Glu Gin Val Met Phe Gly 

180 185 190 

Ala Leu Asp Asn Leu Phe Ala Asn Thr Asn Val Lys Pro Lys Asp He 

195 200 205 

Gly He Leu Val Val Asn Cys Ser Leu Phe Asn Pro Thr Pro Ser Leu 

210 215 220 

Ser Ala Met He Val Asn Lys Tyr Lys Leu Arg Gly Asn He Arg Ser 
225 230 235 240 

Tyr Asn Leu Gly Gly Met Gly Cys Ser Ala Gly Val He Ala Val Asp 

245 250 255 

Leu Ala Lys Asp Met Leu Leu Val His Arg Asn Thr Tyr Ala Val Val 

260 265 270 

Val Ser Thr Glu Asn He Thr Gin Asn Trp Tyr Phe Gly Asn Lys Lys 

275 280 285 

Ser Met Leu He 'Pro Asn Cys Leu Phe Arg Val Gly Gly Ser Ala Val 

290 295 300 

Leu Leu Ser Asn Lys Ser Arg Asp Lys Arg Arg Ser Lys Tyr Arg Leu 
305 310 315 320 

Val His Val Val Arg Thr His Arg Gly Ala Asp Asp Lys Ala Phe Arg 

325 330 335 

Cys Val Tyr Gin Glu Gin Asp Asp Thr Gly Arg Thr Gly Val Ser Leu 

340 345 350 

Ser Lys Asp Leu Met Ala He Ala Gly Glu Thr Leu Lys Thr Asn He 

355 360 365 

Thr Thr Leu Gly Pro Leu Val Leu Pro He Ser Glu Gin He Leu Phe 

370 375 380 

Phe Met Thr Leu Val Val Lys Lys Leu Phe Asn Gly Lys Val Lys Pro 
385 390 395 400 

Tyr He Pro Asp Phe Lys Leu Ala Phe Glu His Phe Cys He His Ala 

405 410 415 

Gly Gly Arg Ala Val He Asp Glu Leu Glu Lys Asn Leu Gin Leu Ser 

420 425 430 

Pro Val His Val Glu Ala Ser Arg Met Thr Leu His Arg Phe Gly Asn 
435 440 445 
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Thr Ser Ser Ser 
450 

Gly Arg Met Arg 
465 

Gly Phe Lys Cys 

Pro Ser Asn Asn 
500 

Thr Leu Ser Tyr 
515 



Ser lie Trp Tyr 
455 

Arg Gly Asn Arg 
470 

Asn Ser Ala lie 
485 

Ser Pro Trp Glu 
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Glu Leu Ala Tyr 
460 

Val Trp Gin lie 
475 

Trp Glu Ala Leu 
490 

Asp Cys lie Asp 
505 



He Glu Ala Lys 

Ala Phe Gly Ser 
480 

Arg His Val Lys 
495 

Lys Tyr Pro Val 
510 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide encoding a polypeptide 
having an amino acid sequence selected from the group 
consisting of: an amino acid sequence substantially 
identical to SEQ ID NO : 2 , an amino acid sequence 
substantially identical to SEQ ID N0:4, an amino acid 
sequence substantially identical to SEQ ID NO: 6, an amino 
acid sequence substantially identical to SEQ ID NO: 8, an 
amino acid sequence substantially identical to SEQ ID 
NO: 10, an amino acid sequence substantially identical to 
SEQ ID NO: 12, and an amino acid sequence substantially 
identical to SEQ ID NO: 14. 

2. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 2. 

3. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 4. 

4. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 6. 

5. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 8. 

6. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 10. 

7. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 12. 

8. The polynucleotide of claim 1, wherein said amino 
acid sequence is SEQ ID NO: 14. 
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9. An isolated polynucleotide, wherein said 

polynucleotide is selected from the group consisting of 

a) SEQ ID N0:1; 

b) SEQ ID NO: 3, 

c) SEQ ID NO: 5, 

d) SEQ ID NO: 7; 

e) SEQ ID NO: 9; 

f) SEQ ID N0:11; 

g) SEQ ID NO: 13; 

h) an RNA analog of SEQ ID NOrl, 

i) an RNA analog of SEQ ID NO: 3, 
j) an RNA analog of SEQ ID NO: 5, 
k) an RNA analog of SEQ ID NO: 1 \ 
1) an RNA analog of SEQ ID NO: 9; 
m) an RNA analog of SEQ ID NO: 11; 
n) an RNA analog of SEQ ID NO: 13; 

o) a polynucleotide having a nucleic acid sequence 
complementary to a) ; b) , c) , d) , e) , f ) , g) , h) , i) , j), 
k) , 1) , m) , or n) ; and 

p) a nucleic acid fragment of a) , b) , c) , d) , e) , 
f ) , g) , h) , i) , j), k) , 1), m) , n) , or o) that is at 
least 15 nucleotides in length and that hybridizes under 
stringent conditions to genomic DNA encoding the 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID N0:8, SEQ ID NO:10, SEQ ID NO:12, or SEQ ID N0:14. 



10, An isolated polypeptide having an amino acid 
sequence selected from the group consisting of: an amino 
acid sequence substantially identical to SEQ ID NO: 2, an 
amino acid sequence substantially identical to SEQ ID 
NO: 4, an amino acid sequence substantially identical to 
SEQ ID NO: 6, an amino acid sequence substantially 
identical to SEQ ID NO: 8, an amino acid sequence 
substantially identical to SEQ ID NO: 10, an amino acid 
sequence substantially identical to SEQ ID NO: 12, and an 
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amino acid sequence substantially identical to SEQ ID 
NO: 14 . 



11. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO:2. 



12. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO:4. 

13. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 6. 



14. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 8. 

15. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 10. 

16. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 12. 

17. The polypeptide of claim 10, wherein said amino 
acid sequence is SEQ ID NO: 14. 

18. A transgenic plant containing a nucleic acid 
construct comprising a polynucleotide selected from the 
group consisting of: 

a) SEQ ID NO:l; 

b) SEQ ID NO:3; 

c) SEQ ID NO: 5; 

d) SEQ ID NO: 7; 

e) SEQ ID NO: 9; 

f) SEQ ID NO: 11; 

g) SEQ ID NO: 13; 

h) an RNA analog of SEQ ID NO:l; 
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i) 


an RNA analog of SEQ ID 


NO : 3 ; 


j) 


an RNA analog of SEQ ID 


NO : 5 ; 


k) 


an RNA analog of SEQ ID 


NO : 7 ; 


1) 


an RNA analog of SEQ ID 


NO : 9 ; 


m) 


an RNA analog of SEQ ID 


N0:11; 


n) 


an RNA analog of SEQ ID 


NO: 13; 


o) 


a polynucleotide having 


a nucleic acid sequence 



complementary to a) , b) , c) , d) , e) , f ) , g) , h) , i) , j) , 
k) , 1) , m) , or n) ; and 

p) a nucleic acid fragment of a) , b) , c) , d) , e) , 
f ) , g) , h) , i) , j), k) , 1), m) , n) , or o) that is at 
least 15 nucleotides in length and that hybridizes under 
stringent conditions to genomic DNA encoding the 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID N0:8, SEQ ID NO:10, SEQ ID N0:12, or SEQ ID N0:14. 

19. The plant of claim 18, wherein said construct 
further comprises a regulatory element operably linked to 
said polynucleotide. 

20. The plant of claim 19, wherein said regulatory 
element is a tissue-specific promoter. 

21. The plant of claim 20, wherein said regulatory 
element is an epidermal cell-specific promoter. 

22. The plant of claim 20, wherein said regulatory 
element is a seed-specific promoter that is operably 
linked in sense orientation to said polynucleotide. 

23. The plant of claim 22, wherein said plant has 
altered levels of very long chain fatty acids in seeds 
compared to the levels in a plant lacking said nucleic 
acid construct . 
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24. A transgenic plant containing a nucleic acid 
construct comprising a polynucleotide encoding a 
polypeptide selected from the group consisting of: an 
amino acid sequence substantially identical to SEQ ID 
N0:2, an amino acid sequence substantially identical to 
SEQ ID NO: 4, an amino acid sequence substantially 
identical to SEQ ID NO: 6, an amino acid sequence 
substantially identical to SEQ ID NO: 8, an amino acid 
sequence substantially identical to SEQ ID NO: 10, an 
amino acid sequence substantially identical to SEQ ID 
NO: 12, and an amino acid sequence substantially identical 
to SEQ ID NO: 14. 

25. The plant of claim 24, wherein said construct 
further comprises a regulatory element operably linked to 
said polynucleotide. 

26. The plant of claim 25, wherein said regulatory 
element is a tissue- specific promoter. 

27. The plant of claim 26, wherein said regulatory 
element is an epidermal cell -specific promoter. 

28. The plant of claim 26, wherein said regulatory 
element is a seed-specific promoter that is operably 
linked in sense orientation to said polynucleotide. 

29. The plant of claim 28, wherein said plant has 
altered levels of very long chain fatty acids in seeds 
compared to the levels in a plant lacking said nucleic 
acid construct. 

30. A method of altering the levels of very long chain 
fatty acids in a plant, comprising the steps of: 
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A) creating a nucleic acid construct, said construct 
comprising a polynucleotide selected from the group 
consisting of: 

a) SEQ ID N0:1; 

b) SEQ ID NO: 3; 
C) SEQ ID N0:5; 

d) SEQ ID N0:7; 

e) SEQ ID NO: 9; 

f) SEQ 'iD NO: 11; 

g) SEQ ID NO: 13; 

h) an RNA analog of SEQ ID NOtl, 

i) an RNA analog of SEQ ID NO: 3, 
j) an RNA analog of SEQ ID NO: 5, 
k) an RNA analog of SEQ ID NO: 7; 
1) an RNA analog of SEQ ID N0:9, 
m) an RNA analog of SEQ ID NO: 11; 
n) an RNA analog of SEQ ID NO: 13; 

o) a polynucleotide having a nucleic acid sequence 
complementary to a), b) , c) , d) , e) , f ) , g) , h) , i) , j) , 
k) , 1) , m) , or n) ; and 

p) a nucleic acid fragment of a) , b) , c) , d) , e) , 
f ) , g) / h) , i) , j), k) , 1) , m) , n) , or o) that is at 
least 15 nucleotides in length and that hybridizes under 
stringent conditions to genomic DNA encoding the 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID N0:8, SEQ ID NO:10, SEQ ID N0:12, or SEQ ID N0:14; and 

B) introducing said construct into said plant, wherein 
said polynucleotide is effective for altering the levels 
of very long chain fatty acids in said plant. 
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